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Foreword 



Through his voluminous and influential writings, editorial activities, organiza- 
tional leadership, intellectual acumen, and strong sense of history, Clifford Am- 
brose Truesdell III (1919-2000) was the main architect for the renaissance of ra- 
tional continuum mechanics since the middle of the twentieth century. The present 
collection of 42 essays and research papers pays tribute to this man of mathematics, 
science, and natural philosophy as well as to his legacy. 

The first five essays by B.D. Coleman, E. Giusti, W. Noll, J. Serrin, and 
D. Speiser were texts of addresses given by their authors at the Meeting in memory 
of Clifford Truesdell, which was held in Pisa in November 2000. In these essays the 
reader will find personal reminiscences of Clifford Truesdell the man and of some 
of his activities as scientist, author, editor, historian of exact sciences, and principal 
founding member of the Society for Natural Philosophy. 

The bulk of the collection comprises 37 research papers which hear witness to 
the Truesdellian legacy. These papers cover a wide range of topics; what ties them 
together is the rational spirit. Clifford Truesdell, in his address upon receipt of a 
Birkhoff Prize in 1978, put the essence of modern continuum mechanics succinctly 
as “ conceptual analysis, analysis not in the sense of the technical term but in the 
root meaning: logical criticism, dissection, and creative scrutiny.” It is in celebra- 
tion of this spirit and this essence that these research papers are dedicated to the 
memory of their hearer, driving force, and main promoter for half a century. Most 
of these papers were presented at the Symposium on Recent Advances and New 
Directions in Mechanics, Continuum Thermodynamics, and Kinetic Theory - In 
Memory of Clifford A. Truesdell III, held in Blacksburg, Virginia, in June 2002; 
parts of two papers were delivered at the meeting Remembering Clifford Truesdell, 
held in Turin in November 2002; and the rest was written especially for the present 
collection. 

The portrait, a photo of which serves as the frontispiece of this collection, 
adorns the Clifford A. Truesdell III Room of History of Science in the library of 
the Scuola Normale Superiore (Pisa, Italy), which was inaugurated in October 2003 
and permanently houses Clifford Truesdell’s previously private collection of books, 
papers, and correspondence. We arc grateful to Mrs. Charlotte Truesdell for helping 
us secure a digital file of this photo and for providing us with the list of published 
works of Clifford Truesdell. 
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Published Works of 
Clifford Ambrose Truesdell III * 



The year of publication is omitted from the entry unless it differs from the year 
under which the entry is listed. Letters following a number indicate subsidiary 
separate publications, as follows: 

P Preliminary report or preprint, 

A Abstract, separately published or only published version, 

C Condensed or extracted version, 

L Lecture concerning part or all of the contents of main entry, 

R Reprint, entire, 

RE Reprint of an extract, 

T Translation, entire, 

TC Translation, condensed, 

TE Translation of an extract. 

The list excludes some 600 reviews published between 1949 and 1971 in Math- 
ematical Reviews, Applied Mechanics Reviews, Zentralblatt fur Mathematik, In- 
dustrial Laboratories, and Mathematics of Computation but includes reviews pub- 
lished in other journals. 

1943 

1. (Co-author P. Nemenyi) A stress function for the membrane theory of shells 
of revolution. Proceedings of the National Academy of Sciences (U.S.A.) 29, 
159-162. 

Other publication in 1943: No. 3A1. 

1944 

2. Alonzo Church, Introduction to Mathematical Logic, Part I, Notes by 
C.A. Truesdell, Annals of Mathematics Studies No. 13, Princeton, Uni- 
versity Press, vi + 118 pp. 

* Note by the editors: This list and the list on p. 29 are slightly edited versions of those that we 
received from Mrs. C. Truesdell, to whom we are heartily grateful. In our editorial work we have 
added a few entries, updated several items, and made a small number of other minor con'ections. To 
G.P. Galdi, K. Hutter, R.G. Muncaster. F. Pastrone, and D. Speiser, we are beholden for their help in 
tracking down article titles and numbers of journal volumes. In what follows, explanatory remarks 
set off by square brackets were made by Clifford Truesdell himself. 
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Member of the Editorial Council, Bollettino di Storia delle Scienze Matema- 
tiche, Unione Matematica Italiana, 1979-2000. 

Member of the Editorial Board, Speculations in Science and Technology, 1980- 
1987. 

Member of the Editorial Board, Ganita-Bhdrati, 1981-1993. 

Member of the Editorial Board, Stability and Applied Analysis of Continuous 
Media, 1991-1993. 
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Eulogium 



CLIFFORD AMBROSE TRUESDELL III 
( b. February 18, 1919; d. January 14, 2000) 



Clifford Ambrose Truesdell III died on January 14, 2000. This man of mathematics, 
science and natural philosophy focused his strong sense of history and his talents 
and taste for identifying major advances in rational mechanics to establish a re- 
naissance in mechanics and materials research that has prospered since the middle 
of the 20th century. He contributed substance and spirit to the areas of continuum 
mechanics, thermodynamics and kinetic theory, challenged the establishment and 
its dogmatic thinking, and engaged the community of young researchers with a new 
and fundamental direction of inquiry which concentrated on foundations, structure 
and logical implication. His letters, his books, his essays, his university courses, his 
co-founding of the Journal for Rational Mechanics and Analysis, his founding of 
the Archive for Rational Mechanics and Analysis, the Archive for History of Exact 
Sciences, Springer Tracts in Natural Philosophy , the Society for Natural Philoso- 
phy and his support of the research of other scientists were exceptional. His joint 
encyclopedic articles, The Classical Field Theories in 1960 and The Non-Linear 
Field Theories of Mechanics in 1965, were masterful, erudite, comprehensive, and 
pioneering works of lasting value. 

Clifford Truesdell was a scholar of immense creativity, a linguist, a connoisseur 
of the arts and a historian unfettered by fashion. Throughout his life, he taught 
us to preserve scholarship, to question foundations and to follow a path of reason 
with principle, purpose and passion. His actions constantly provided a stimulus, 
an environment and a framework for scientific discovery. He made a profound 
contribution to our science. 



Roger Fosdick 
University of Minnesota 
Minneapolis 
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NJ 08854-8058, U.S.A. 
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Below is a shortened version of the text of a talk given at the Meeting in memory 
of Clifford Truesdell held in Pisa in November of 2000 and at the Symposium on 
Recent Advances and New Directions in Mechanics, Continuum Thermodynamics, 
and Kinetic Theory held in Blacksburg in June of 2002. Appended to that text is 
the Curriculum Vita of Professor Truesdell as he kept it up-to-date until October 
1993, at which time, with his approval, I had it transcribed into its present format. 



Clifford Truesdell and Thermodynamics 

I consider myself to have been among the most fortunate of men: I have had a 
teacher and friend, indeed, more than a friend, in effect, an elder brother, who was 
the leading scholar in my science and who gave me encouragement, sound advice, 
and every type of help that I might need, even when I did not know that I needed 
it. Most important of all, he taught me that careful scholarship and the persistent 
search for insight and understanding are far more important than facile skill in the 
use of contemporary techniques for the solution of currently popular problems. 

Clifford Ambrose Truesdell III was born in Los Angeles, February 18, 1919. In 
his 23rd and 24th years he received, from the California Institute of Technology, 
the B.S. Degree in Mathematics, the B.S. Degree in Physics, and the M.S. Degree 
in Mathematics, and, in addition, from Brown University, a Certificate in Mechan- 
ics. In his 25th year he received, from Princeton University, the Ph.D. Degree in 
Mathematics. 

In the course of his career he received numerous awards and prizes, among 
which are: the Euler medal of the USSR Academy of Sciences, which was received 
twice, in 1958 and 1983, the Bingham Medal of the Society of Rheology, the 
Panetti Prize and Gold Medal of the Accademia di Scienze di Torino, the Birkhoff 
Prize of the American Mathematical Society and the Society for Industrial and 
Applied Mathematics, and the Ordine del Cherubino of the Universita di Pisa. He 
received honorary doctorates from five universities and was awarded membership 
in twelve international academies of science; among them is the illustrious Italian 
Accademia Nazionale dei Lincei. 



l 

C.-S. Man and R.L. Fosdick (eds.), The Rational Spirit in Modern Continuum Mechanics, 1 - 13 . 
© 2004 Kluwer Academic Publishers. Printed in the Netherlands. 
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Although I was an undergraduate student at Indiana University from February 
1948 to June 1951, and hence my stay in Bloomington Indiana did overlap, albeit 
partially, that of Clifford and Charlotte Truesdell, we first met years later, in the 
Spring of 1958, at a scientific meeting in Lancaster Pennsylvania on the subject 
of rheology. The meeting was followed by an exchange of letters about thermody- 
namics, which was in turn followed by a one-week visit with Clifford and Charlotte 
Truesdell at their house in Bloomington in the winter of 1958 and not long after 
that by his two week visit to the Mellon Institute. If you hear with me, I should 
like to tell you about events of that period from the point of view of one whose 
subsequent view of science, the aids, and life itself were completely changed by his 
interaction with Clifford Truesdell. 

In the summer of 1957 I left a position in the chemical industry to become a 
Senior Fellow of the Mellon Institute in Pittsburgh, and soon after my arrival I 
started to attend courses given by Walter Noll on continuum mechanics and related 
branches of mathematics. Before the academic year was over, we both went to 
the rheology meeting in Lancaster. The list of speakers for that meeting included 
Clifford Truesdell, Walter Noll, Jerry Ericksen, and Ronald Rivlin. At a luncheon 
that was held there, Walter Noll and I were sitting at a table with several persons 
other than those just mentioned, and I expressed the view that although we were all 
told in school that thermodynamics is a closed subject whose general principles arc 
known and pertain to only equilibrium states or to processes that stay so close to 
equilibrium that all departures from equilibrium arc governed by linear constitutive 
relations, I could not believe that such is the case, and I felt that our knowledge of 
what the science of thermodynamics could be was in some way analogous to what 
the cultivators of mechanics knew about their subject at the time of the publication 
of Newton’s Principia and the early work of the Basel School. All within healing, 
with the exception of Walter Noll, agreed with each other that I was wrong. Walter 
agreed with me and suggested that I read certain papers of Clifford Truesdell and 
that we talk more when we were back in Pittsburgh. 

We did talk more, much more, and I, with my eyes open wide with excitement, 
read whatever I could of Clifford Truesdell’s writings on thermodynamics. 

There were papers that carried forward Maxwell’s idea that a properly formu- 
lated theory of diffusion of mass in fluid mixtures should account, as Fick’s Law 
does not, for balance of linear and angular momentum. 

There were passages decrying the vagueness that rendered nearly empty, at least 
for mathematicians, the text-book versions of the Second Law of Thermodynamics. 
Among these was a footnote to a discussion of classical thermodynamics in his 
paper, The Mechanical Foundations of Elasticity and Fluid Dynamics, published 
in 1952 in Volume 1 of the Journal for Rational Mechanics and Analysis. The foot- 
note urges the rational student “to cleave the stinging fog of pseudo-philosophical 
mysticism” hiding the mathematics behind a certain formulation of the Second 
Law. It was clear that he saw that thermodynamics, far from being a closed subject, 
was in a terrible state. 
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Today we know that he was then doing research that would supply the key to 
setting things straight. In that paper of 1952 there appeal's a preliminary version of 
what he, with great generosity, called the Clausius-Duhem inequality , and which 
appeared in its present form in The Classical Field Theories of Mechanics, by 
Clifford Truesdell and Richard Toupin, published in 1960 in the Handbuch der 
Physik, Vol. Ill: 




Here H is the total entropy of the part IP, 6 is the thermodynamical temperature, 
q is the inward directed heat flux, r is the supply of heat from external sources, and 
n is the outward directed unit normal vector. 

On reading those two works one sees that before 1960 it was clear to Clifford 
Truesdell that that inequality is the correct mathematical form of the second-law 
of thermodynamics for the materials or systems such that the total entropy H is 
an integral over IP of an entropy density rj, and nearly all the thermodynamical 
systems that we consider in continuum physics have that property. The question 
that seemed open at the time was the following: How does one use the inequality? 
Is it a restriction on the process, or a relation to be obeyed by all processes? 

In the early 1960's Walter Noll put to me the idea, as if it should be obvious to 
every one, that the inequality is a restriction on all processes that arc admissible in 
the material of which the body is composed, and, because one defines each material 
by giving a set of constitutive relations, the Clausius-Duhem inequality, as it must 
hold for all processes compatible with those relations, becomes a restriction on 
constitutive relations. In another act of great generosity, Walter suggested that we 
develop the idea together. It took awhile to sort the argument out and to present it 
in a way that would convince the wary. The paper was written while he and I were 
on sabbatical leave and were guests of Clifford Truesdell at Hopkins. 

Shortly thereafter, I used this approach to the Clausius-Duhem inequality to 
render mathematical ideas I had been struggling to express for years about thermo- 
dynamical restrictions on materials with gradually fading memory. 

Many of the people in this room have done research on the Clausius-Duhem 
inequality, in the study of its implications for new classes of constitutive relations, 
in the study of its implications for the theory of the evolution of singular surfaces, 
or in the study of its logical relation to other mathematically precise statements of 
the second law. I am certain that I do not exaggerate when I say that every one 
of them feels a deep debt of gratitude to Clifford Truesdell for finding the tool to 
cleave the fog that once obscured the science of thermodynamics.* But we have 
even greater debts to him. 

I should like to return to the time of that rheology meeting in Lancaster and elab- 
orate on my own debt to him. A few months after that meeting, I received a letter 
from Clifford in which he said that he heard from Walter Noll that I had studied at 

* The remark was appropriate to both the meeting in Pisa and symposium in Blacksburg. 
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Yale, had read some of the works of J. Willard Gibbs, and knew some things about 
thermodynamics. He then asked if I could clarify some passages in Gibbs’ paper on 
The Equilibrium of Heterogeneous Substances. With Walter’s help I studied those 
passages for months and finally sent off a long letter in which Clifford’s questions 
were answered but issues were raised which for decades affected my work on the 
stability of thermodynamical systems. In his reply he invited me to visit him and 
Charlotte in Bloomington. Many years later, in the summer of 1993, 1 wrote to him 
about that period and with your indulgence I shall read from the letter. I know there 
arc others here who could say similar things about our beloved friend, and when I 
have finished you will see what I mean about our greater debt to him. What I read 
now arc three brief consecutive paragraphs out of a long letter. 

“My correspondence with you about Gibbs’ conditions for the stability of fluid 
phases occurred in this happy period. I visited your house in Bloomington for 
a week in, I believe, the winter of 58-59. That visit had a major influence 
on my view of what is important. As if struck by lightening, like the one 
Christians call Paul, I suddenly saw clearly something for which I was ready 
by instinct. In my case it was not a new religion, but a way to get out of a rut, 
by seeking to study the languages, the writings, the art, the customs, the lives, 
and the music and diversions of the ages in which our science originated and 
the works we admire were produced. Your example gave me the impetus to 
try to learn properly other languages. I became serious in my study of Italian. 
In later years I have tried to improve my French. (In my 62nd year, I started 
working on Attic and Homeric Greek, but was too old for such efforts.)” 

“The influence of our friendship on my intellectual development has been too 
great to describe in a few phrases couched in generalities. A brief summary is 
impossible. Only examples will do, and there is space here for only one.” 

“That one concerns my behavior when I am writing something for publication. 
Invariably, upon completion of a passage I put down the pencil and ask myself: 
‘What would Clifford say if he saw what I have just written?’ The subsequent 
imagined conversation is often such that I feel obliged to rewrite the passage.” 

Would that he were here now, to help us live up to the standards of scholarship 
and clarity that he set for us ! 




MEMORIES OF CLIFFORD TRUESDELL 



5 



Curriculum Vita of Clifford Ambrose Truesdell III 

Born in Los Angeles, California, February 18, 1919 

Studies 

European Travel and Private Study, 1936-1938. 

California Institute of Technology: B.S. (Mathematics), 1941; B.S. (Physics), 
1941; M.S. (Mathematics), 1942. 

Brown University: Certificate in Mechanics, 1942. 

Princeton University: Ph.D. (Mathematics), 1943. 

Primary Employment 

California Institute of Technology: 

Assistant in history, debating, and mathematics, 1940-1942. 

Brown University: 

Assistant in Mechanics, 1942. 

Princeton University: 

Instructor of Mathematics, 1942-1943. 

University of Michigan: 

Instructor of Mathematics, 1943-1944. 

Radiation Laboratory, Massachusetts Institute of Technology: 

Staff Member, 1944-1946. 

U.S. Naval Ordnance Laboratory, White Oak, Maryland: 

Chief, Theoretical Mechanics Subdivision, 1946-1948. 

U.S. Naval Research Laboratory, Washington, D.C.: 

Head, Theoretical Mechanics Section, 1948-1950. 

Indiana University: 

Professor of Mathematics, 1950-1961. 

Johns Hopkins University: 

Professor of Rational Mechanics, 1961-1989; Emeritus, 1989- 

Part-time, Temporary, and Visiting Appointments 

University of Maryland, College Park: 

Lecturer in Mathematics, 1946-1947, 

Assistant Professor of Mathematics, 1947-1949, 

Associate Professor of Mathematics, 1949-1950. 

U.S. Naval Research Laboratory, Washington, D.C.: 

Consultant, 1951-1955. 

Universitat Marburg an der Lahn: 

Gastprofessor, 1957. 

Mathematics Research Center, U.S. Army, University of Wisconsin, Madison: 
Member, 1958. 
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Mellon Institute, Pittsburgh: Visitor, 1959. 

Socony-Mobil Research Laboratory, Dallas, Texas: 

Colloquium Lecturer, 1960. 

U.S. National Bureau of Standards, Washington, D.C.: 

Consultant, 1950-1962. 

University of California at Los Angeles: Special Lecturer, 1963. 

Technische Universitat Berlin-Charlottenburg: Gastprofessor, 1964. 

University of Washington, Seattle: 

Walker-Ames Professor, 1964. 

Australian Mathematical Society Summer Research Institute, Melbourne: 
Lecturer, 1965. 

Syracuse University, New York: 

Distinguished Visiting Professor, 1965. 

International School on Nonlinear Problems in Physics, Munchen: 

Lecturer, 1966. 

Universita di Pisa: 

Visiting Lecturer, 1966, 1973-1975, 1978, 1980, 1982, 1985, 1987. 

Sandia Corporation, Albuquerque, N.M.: 

Visitor, 1966. 

Drexel Institute of Technology, Philadelphia: 

Seventy-Fifth Anniversary Lecturer, 1966-1967. 

Accademia dei Lincei, Roma: 

Professore Linceo, 1970, 1973. 

Universidade Federal do Rio de Janiero: 

Lecturer for the Coordenacao dos Programas de Pos-Graduacao 
de Engenharia and the Instituto de Matematica, 1972. 

Georgia Institute of Technology, Atlanta: 

Consultant, 1973-1974. 

Scuola Normale Superiore, Pisa: 

Ospite Linceo, 1974. 

Brookhaven National Laboratories, Long Island: 

Consultant (Advanced Codes Review Committee, U.S. Nuclear Regulatory 
Commission), 1975-1983. 

University of Delaware: 

Bicentennial Scholar in Residence, 1976. 

Instituto de Ingenieria Mecanica y Mecanica Teorica y Applicada, 
Universidad Autonoma de Mexico, Mexico D.F.: 

Lecturer, 1977. 

Universita di Bologna: 

Professore Visitatore, 1978, 1987, 1988. 

Universite Catholique de Louvain: 

Visiting Professor, 1979. 
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Institut des Hautes Etudes Scientifiques, Bures-sur-Yvette: 

Visitor, 1981. 

Cornell University: 

First Distinguished Visiting Professor of Theoretical and Applied 
Mechanics, 1982. 

Universita di Firenze, Scuola di Architettura: 

Professore a Contratto, 1985. 

Scuola di Ingegneria Strutturale, Universita di Roma “La Sapienza”: 

Visiting Professor, 1990. 

Short Lecture Series and Named Single Lectures 

University of Toronto, 1949. 

Sorbonne, Paris, 1949, 1955. 

State University of Iowa, 1956. 

Indiana University, 1959. 

Scuola Internazionale di Fisica, Varenna, 1960. 

Universita di Padova, 1961. 

Universita e Politecnico di Milano, 1961. 

Midwest Mechanics Seminar Tour, 1962. 

Academy of Sciences, Warsaw, 1963, 1964. 

The Johns Hopkins University, 1965. 

Gibson Lecturer in the History of Mathematics, University of Glasgow, 1965. 
Distinguished Visiting Lecturer, Centennial of the University of Kentucky, 
1965. 

NSF Conference on Recent Developments in Continuum Mechanics, Virginia 
Polytechnic Institute, 1966, 1969. 

Koerner Lecturer, Simon Fraser University, Burnaby, BC, 1969. 
Distinguished Lecturer in Chemical Engineering, University of Rochester, 

1970. 

International Centre of Mechanical Sciences, Udine, 1971. 

Centennial Lecturer in Engineering Mechanics, Virgina Polytechnic Institute, 

1971. 

Section de Transferts Thermiques, Centre de Recherches Nucleates, 
Grenoble, 1973. 

Bajer Lecture, Princeton University, 1975. 

Durelli Lecture, Catholic University of America, 1977. 

International Symposium on Continuum Mechanics and Partial Differential 
Equations, Universidade do Rio de Janeiro, 1977. 

University of Chicago, 1979. 

Thermofluids Lectures, Departments of Chemical and Mechanical 
Engineering, School of Mines, University of Arizona, Tucson, 1980. 

Ritt Lectures, Department of Mathematics, Columbia University, 1982. 
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First MTU Lectures in Engineering Science, Michigan Technical University, 
Houghton, 1983. 

St. Andrews University, Scotland, 1983. 

Distinguished Scientist Lecture, Trinity University, San Antonio, TX, 1984. 
Allen Lecture in Mathematical Sciences, Rensselaer Polytechnic Institute, 
1985. 

Page-Barbour Lectures, University of Virginia, Charlottesville, 1985. 

Franklin Lecture, Auburn University, Auburn, AL, 1986. 

Invited Single Lectures to Meetings and Symposia 

American Mathematical Association (Baltimore, 1948). 

American Physical Society (Charlottesville, 1949; State College, PA, 1953; 
San Diego, CA, 1971). 

International Conference on Theoretical Fluid Mechanics, Harvard, 1950. 
Conference on Elasticity, University of Maryland, 1952. 

Sigma Xi (Indiana University, 1952; State University of Iowa, 1956; 

Illinois Institute of Technology, 1960; Georgia Institute of Technology 
(Monie A. Ferst Memorial Lecture), 1969; University of Tennessee, 1976; 
McGill University, 1976. 

Symposium on Ultrasonic Absoiption and Dispersion in Fluids, Brown 
University, 1952. 

Discussion Meeting on the Second Viscosity of Fluids, The Royal Society, 
London, 1953. 

First Midwestern Conference on Solid Mechanics, University of Illinois, 1953. 
Symposium of the Office of Ordnance Research and the American 
Mathematical Society, Chicago, 1954. 

American Society for Engineering Education, Urbana, 1954. 

Gesellschaft fur Angewandte Mathematik und Mechanik (General Lectures), 
Berlin, 1955; Hamburg, 1957. 

Sixth Conference on Hydraulics (General Lecture), Iowa City, 1955. 
Eulerfeier (Main Lecture), Basel, 1957. 

Washington Philosophical Society, 1958. 

Accademia Nazionale di Scienze, Lettere ed Arti (Inaugural Address), 
Modena, 1960. 

Celebrazioni Archimedee, Siracusa, 1961. 

I.U.T.A.M. Symposium on Second-order Effects in Elasticity, Plasticity, 
and Fluid Dynamics (General Lecture), Haifa, 1962. 

Fourth U.S. National Congress of Applied Mechanics (General Lecture), 
Berkeley, 1962. 

Summer Conference on Non-ideal Mechanical Behavior, Princeton, 1962. 
American Society of Mechanical Engineers, Washington, 1962. 

Symposium on Hemodynamics and Hydrodynamics, Baltimore, 1962. 
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International Congress of Rheology (Bingham Medal Address), Providence, 
1963. 

Society for Natural Philosophy at Pittsburgh, 1963; Notre Dame, 1971; 
Seattle, 1972; Pisa, 1974, 1978; Williamsburg (Fifteenth Anniversary 
Lecture), 1978; Rolla, 80; Brown, 1983; Baltimore, 1987; Pittsburgh 
(Walter Noll retirement symposium), 1993. 

Eleventh International Congress of Applied Mechanics, Munich, 1954. 

Philosophy of Science Seminal - , University of Delaware, 1965. 

Convegno dei Meccanici Italiana, Modena, 1966. 

I.U.T.A.M. Symposium on Irreversible Thermodynamics in Continuous 
Media, Vienna, 1966. 

Commemoration of Newton’s Annus Mirabilis, Austin, 1966. 

First Canadian National Congress of Applied Mechanics (General Lecture), 
Quebec, 1967. 

Third Buhl International Conference on Materials, Mellon Institute, 1968. 

Symposium on “The Interplay between Mathematics and Physics - The Rise 
of Mathematical Physics” at the University of Aarhus, 1970. 

Southwest Graduate Research Conference, Houston, Texas, 1971. 

Second Annual Meeting of the American Society for Eighteenth Century 
Studies, College Park, Maryland, 1971. 

Banquet address, meeting of the History of Science Society and Society for 
the History of Technology, Washington, 1972. 

Sectional address (History and Paedagogy), International Congress of 
Mathematicians, Vancouver, 1974. 

Address at the Engineering Commencement, Tulane University, New Orleans, 
1976. 

Address on receipt of a Birkhoff Prize, Annual meeting of the American 
Mathematical Society and the Society for Industrial and Applied 
Mathematics, 1978. 

Euromech Colloquium, Pisa, 1978. 

Italo-American Co-operative Science Seminar, Venice, 1978. 

Organizer’s address. Special Symposium on “Conceptual Analysis in Rational 
Thermomechanics”, Summer meeting of the American Mathematical 
Society, Providence, 1978. 

Keynote address on Constitutive Relations, E.P.R.I. Workshop on Two-Phase 
Flow, Tampa, Florida, 1979. 

Colloquium on Continuum Thermodynamics, Society of Engineering 
Science, Northwestern University, Evanston, 1979. 

Celebration of the 75th anniversary of Scientia, Milano, 1980. 

Plenary lecture, 8th International Congress of Rheology, Naples, 1980. 

General Lecture, Society of Engineering Science, Atlanta, 1980. 

Colloquium on the History of Mathematics, Winter meeting of the American 
Mathematical Society, San Francisco, 1981. 
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Keynote address, 11th Southeastern Conference on Theoretical and Applied 
Mechanics, Huntsville, Alabama, 1982. 

Joint Session on the History of Mathematics, meetings of the American 
Mathematical Society and American Mathematical Association, Toronto, 
1982. 

Festakt Daniel Bernoulli (Main Lecture), Basel, 1982. 

Leonardo e Beta della ragione (Congress organized by Scientia and the 
governing bodies of Milano and Lombardy), Milano, 1982. 

25th British Theoretical Mechanics Colloquium, Manchester, 1983. 

International Symposium, “The Codex Hammer in Context”, Walters Art 
Gallery, Baltimore, 1983. 

Workshop on the Laws and Structure of Continuum Thermomechanics, 
Institute for Mathematics and its Applications, University of Minnesota, 
Minneapolis, 1983. 

Convegno sul tenia “Termoelasticita finita”, Accademia Nazionale dei Lincei, 
Rome, 1985. 

International Conference on Nonlinear Mechanics, Shanghai, 1985. 

900th Anniversary celebrations. University of Bologna, 1987, 1988. 

300 Years of Gravitation, University of Cambridge, England, 1987. 

International Conference dedicated to the Tricentenary of the Publication of 
Newton’s Principia, U.S.S.R. Academy of Sciences, Moscow, 1987. 

First Plenary Lecture, 4th National Congress of Theoretical and Applied 
Mechanics, Coimbra, Portugal, 1987. 

Celebration of the 300th anniversary of Newton’s Principia, Technische 
Hochschule Darmstadt, 1988. 

First Plenary Lecture, III International Workshop on Mathematical Aspects of 
Fluid and Plasma Dynamics, Salice Terme, Italy, 1988. 

First Plenary Lecture, IX Congress Nazionale delTAssociazione Italiana di 
Meccanica Teorica ed Applicata, Bari, 1988. 

Imola Conference, Universita degli Studi di Bologna, September 5-7, 1988. 

Inaugural Charles E. Foster Lecture, School of Aerospace and Mechanical 
Engineering, University of Oklahoma, Norman, 1990. 

Convegno Internazionale “I Riccati e la cultura della Marca nel Settecento 
Europeo”, Castelfranco Veneto, 1990. 

First Rutgers Conference on Theoretical Mechanics: The Dynamics of Rods, 
August 24-27, 1990, Rutgers University, New Brunswick. 

Convegno Internazionale in Memoria di Vito Volterra, Accademia Nazionale 
dei Lincei, October 8-11, 1990. 

Editorial Positions 

Co-founder and Co-editor, Journal of Rational Mechanics and Analysis, 
1952-1956. 
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Editor or Co-editor, Leonhardi Euleri Opera Omnia, Series II, Vols. 10-13, 
18-19, 1952-1971. 

Co-editor, Handbuch der Physik, (Springer) Vols. 8/1, 8/II, 9 and 6a/l-6a/4, 
1956-1974. 

Founder and Editor, Archive for Rational Mechanics and Analysis, 1957— 
1967, Co-editor, 1967-1985; Editor, 1985-1989. 

Editor, Reihe fur Mechanik, Ergebnisse der Angewandten Mathematik, 1957— 
1962. 

Founder and Editor, Archive for History of Exact Sciences, I960-. 

Founder and Editor, Springer Tracts in Natural Philosophy, 1962-1966; 
Co-editor, 1967-1978; Editor, 1979-. 

Co-editor, Studies in the Foundations, Methodology and Philosophy of 
Science, 1966-1970. 

Member of the Editorial Board, Rendiconti del Circolo Matematico di 
Palermo, 197 1-. 

Member of the Editorial Board, Annali della Scuola Normale Superiore, Pisa, 
1974- 

Member of the Editorial Board, Meccanica, 197 4-. 

Member of the International Editorial Board, II Nuovo Cimento B, 1979- 
1981; II Nuovo Cimento D, 1982-1987. 

Member of the Editorial Council, Bollettino di Storia delle Scienza 
Matematiche, Unione Matematica Italiana, 1979-. 

Member of the Editorial Board, Speculations in Science and Technology, 
1980-1987. 

Member of the Editorial Board, Ganita-Bharati, 198 1-. 

Member of the Editorial Board, Stability and Applied Analysis of Continuous 
Media, 1991—. 

Organizational Positions 

U.S. Correspondent, International Mathematical News (Austria), 1952-1956. 

Member, Committee on Applied Mathematics, U.S. National Research 
Council, 1954-1956. 

Sponsor for Elasticity, American Society for Mechanical Engineers, 1956— 
1958. 

General Chairman, Conference on the Foundations of Mechanics and 
Thermodynamics, National Bureau of Standards, 1959. 

Member of organizing committee. International Conference on Rarefied Gas 
Dynamics, Berkeley, California, 1960. 

Member of organizing committee. International Congress of Logic and the 
Philosophy of Science, Stanford, 1960. 

Member of organizing committee, I.U.T.A.M. Conference on Second-order 
Effects in Elasticity, Plasticity, and Fluid Mechanics, Haifa, 1962. 




12 



B.D. COLEMAN 



Co-founder, Society for Natural Philosophy, 1963; Director, 1963-1984; 
Secretary, 1963-1965, 1970-1971, 1980-1981; Chairman, 1967-1968, 
1983-1984; 

Member of the Program Committee, 1975-1976. 

Co-chairman of the local committee and Chairman of the Round-Table 
Discussion, meetings of the Society for Natural Philosophy at Baltimore, 
1963, Bressanone, 1965, Chairman of a Round-table Discussion at the 
meetings at Chicago, 1966; Cincinnati, 1970; Cincinnati, 1977; Madison, 
1984. Co-chairman of the local committee for the meeting at Baltimore, 
1965. 

Coordinator, C.I.M.E. Course on Nonlinear Continuum Theories, 
Bressanone, 1965. 

Co-Chairman, First Joint Italian- American Cooperative Science Seminal - , 
Udine, 1971. 

Member of the Scientific Committee, Symposium on Problems of Plasticity, 
Polish Academy of Sciences, Warsaw, 1972. 

Co-Chairman, Italian-American Cooperative Science Seminal - , Udine, 1974. 

Organizer of Special Symposium “Conceptual Analysis in Rational 

Thermomechanics”, Summer Meeting, American Mathematical Society, 
Providence, 1978. 

Member of the Steering Committee, International Conference on Nonlinear 
Mechanics, Shanghai, 1985. 

Honorary Doctorates 

Dott.ing.h.c. in Mechanical Engineering (Fluid Mechanics and History of 
Science), Centenary of the Politecnico di Milano, 1965. 

D.Sc. (Engineering), Tulane University, 1976. 
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In many ways, the research performed by Clifford Truesdell on the history of 
mathematics can be summarized by the title of the first article, at the beginning of 
the first issue of the Archive for the History of Exact Sciences : “A program towards 
rediscovering Rational Mechanics in the age of reason”. Two themes come together 
and will always recur in Truesdell’s research. The first one is reason: in an age when 
the term “enlightenment” took up also negative meanings, Truesdell never stopped 
claiming, in a decisive and clear language, the supremacy of reason as the only 
guide to human behavior. He saw the end of this “age of reason” in the French 
revolution, the source, in his opinion, of all atrocities of modern times, from the 
lager to the gulag, from the nuclear threat to universal suffrage. Truesdell sees all 
subsequent historical events in an exclusively negative way. Although we cannot 
understand or agree with Truesdell on all these, we see that his theory envisages 
reason as the only compass able to guide mankind in his daily choices, and dis- 
cover in his mathematical philosophy a model of that rational culture that seems 
sometimes increasingly far from us and obsolete. 

The second theme is rational mechanics. All of the work of Truesdell on history 
places at the center the onset and the development of modern rational mechanics, 
which is a well-known discipline for Italian scientists, but is somehow unrelated 
to the Anglo-Saxon tradition. Rational mechanics is considered by Truesdell, first 
of all, as an out and out mathematical discipline, equally far from pure abstract 
speculation and from “big science” with its big-scale projects and few ideas. 

Not only private, individual experimental researches were performed in the eigh- 
teenth century; there were also large, cooperative projects. As today, they cost more 
than real science, and they attracted administrators. But the effect of all this expense 
on what we now consider the achievement of the period was nil. The method used in 
the great researches was entirely mathematical, but the result was not what would now 
be called pure mathematics. Experience was the guide; experience, physical experi- 
ence and the experience of accumulated previous theory. If we were to seek a word 
for what was done, it would not be physics and it would not be pure mathematics; 
least of all would it be applied mathematics: It would be rational mechanics.* 

* Essays in the History of Mechanics, Springer- Verlag, Berlin, 1968, p. 136. 
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In rational mechanics, Truesdell has studied especially the founders, “the im- 
mortal ones” in his personal definition: Euler, Jacob, Johann and Daniel Bernoulli, 
Cauchy. This choice is evidently a reflection of his personal orientations and taste; 
there arc, however, deeper reasons than his personal preferences: these reasons 
originate from the way Truesdell conceived the study of the historical aspects of 
mathematics. 

One of the key points of Truesdell's perspective on the history of mathematics 
concerns competence. Truesdell did not indulge in sociological investigations or 
in the description of cultural circles and cultural institutions, except for the cases 
where those aspects had a decisive function for the development of the research. 
His history was about contents rather than circumstances. According to Truesdell, 
the history of science has reason to exist only if it tells about science; otherwise 
it will just exert a bad influence both on history and, especially, on science itself. 
For this branch of history, one must have a detailed, deep and diverse knowledge: 
in the first place, knowledge of mathematics, based on an extensive study of the 
subject, preferably associated with some kind of familiarity with research proce- 
dures, and then historical and philological competence. All this knowledge can 
only be acquired after a long study period: the history (or the philosophy) of math- 
ematics cannot be the occupation of retired mathematicians or literates without any 
scientific background: 

The philosophy of science, I believe, should not be the preserve of senile scientists 
and of teachers of philosophy who have themselves never so much as understood the 
contents of a textbook of theoretical physics, let alone done a bit of mathematical 
research or even enjoyed the confidence of a creating scientist.* 

Since it is not a matter of describing an environment that looks the same to each 
observer, but rather a matter of analyzing and understanding different contributions 
hearing the most diverse degrees of deepness and quality, the specificity and the 
vastness of the history of mathematics raise the urgent problem of relevance. As he 
cannot describe and reproduce all existing written treatises and documentation on 
mathematics, the historian must make a choice and illustrate, at the same time, the 
criteria on which he bases his choice. Truesdell has only one criterion: relevance. 
First of all, after examining the huge series of authors and literature, the historian 
will have to identify who and what gave a concrete contribution to the development 
of the discipline, then comprehend and describe the contents. This absolutely re- 
quires a specific competence; from this comes, furthermore, the need to evaluate, 
express judgements and to take a position. 

In writings on the history of science today, as in all aspects of social intercourse, 
it becomes increasingly bad taste to call a spade a spade. In the particular appli- 
cation to the history of science, the compulsion to euphemy assumes the form of 
a solemn refusal to admit that there is such a thing as wrong in science. All that 
matters is how the scientists of one epoch thought and felt about nature, their own 

* An Idiot's Fugitive Essays on Science, Springer- Verlag, New York, 1984. p. vii. 
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work, and the work of others. In particular, the beginner is enjoined above all not 
to take sides, like the positivist historians of the last century, and set past science 
into categories of true and false according as it does or does not agree with what is 
taught in school science of our own day. However admirable this philosophy may 
be in promoting peace and mutual love among historians of science, it disregards 
one aspect of science that is not altogether negligible, namely, that scientists seek the 
truth, not a truth. He who refuses to “take sides” in science in effect negates science 
itself by denying its one and common purpose. He reduces science to just one more 
social manifestation. In so doing, not only does he by implication belittle the great 
scientists of the past, but also he sins against history, for in his attempt at historical 
impartiality he destroys the object, namely science, the history of which he claims 
to write.* 

From this point of view, the mathematician, and consequently also the historian 
of mathematics, derives an undoubted advantage from the fact that mathematics, 
as opposed to other disciplines, applies objective criteria to distinguish truth from 
wrong and relevant aspects from secondary ones: 

Now a mathematician has a matchless advantage over general scientists, historians, 
politicians, and exponents of other professions: He can be wrong. A fortiori, he can 
also be right. There are errors in Euclid, and, to within a set certainly of measure 
zero on the ordinary human scale, what Euclid proved to be true in ancient Greece is 
true even in the colossal, unprecedented, nucleospatial, totally welfared today. In the 
advance through the physical, social, historical, and other sciences, the demarcation 
between truth and falsehood grows vaguer, until in some areas truth can be rezoned as 
falsehood and falsehood enshrined into truth by consensus of “acknowledged experts 
and authorities” or even popular vote.** 

According to Truesdell, to take a position is a duty, perhaps the main duty of a 
historian, who must distinguish relevance from fame and put each author in the 
right perspective. In his treatises about a period or an author, Truesdell never 
neglects to take a clear position and to go so far that, in some cases, it looks 
as if he intentionally chooses topics which allow him to take a position against 
established interpretations and evaluations, often accepted and repeated without 
criticism. His heroes, as we explained, are Euler, Bernoulli, Cauchy and, in history, 
Pierre Duhem. On the opposite side, he puts D’Alembert, Lagrange and Mach: the 
last two arc considered responsible, by the way, for the misunderstandings and the 
forgeries present in the history of mechanics. 

The first volume [of the Encyclopedic ] carries as frontispiece a magnificent engraving 
of D’Alembert, who, under the guise of authoritative reviews, filled its pages with 
shoddy hashes of antiquated science served up with a sauce of his own prejudices, 
advertisements for his researches, and attacks on his opponents.^ 

* Essays in the History of Mechanics, pp. 145-146. 

** Essays in the History of Mechanics, p. 140. 

^ The rational mechanics of flexible or elastic bodies 1638-1788 (Leonhardi Euleri Opera 
Omnia II, XI (2)), p. 245n. 
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Lagrange’s talent for algebra was undoubtedly great, but in respect to fundamental 
questions of analysis or mechanics his work does not attain the logical and concep- 
tual standards of his great predecessors. Also, the proportion of nontrivial errors in 
Lagrange’s calculation is high compared with other major mathematicians’. This 
body of errors seems to have attracted little notice, so that Lagrange is generally 
given credit for having solved several problems on which his work is largely or totally 
wrong. 

It is time for a reappraisal of the work of the French mathematicians, a reappraisal 
constructed, in defiance of the generalites from the obituaries and the descendents of 
the obituaries, upon critical study of the work done. I am confident such a reappraisal 
would much reduce the importance of D’Alembert and Lagrange, would yield 
a more realistic view of Laplace, Monge, and Fourier, would raise Clairaut 
and Poisson to their just level, and would reveal Cauchy as the towering giant of 
his age and nation.* 

While Lagrange’s book is a good starting place, experience with it has led me 
to the following working hypotheses: 

1. There was little new in the Mechanique Analitique; its contents derive from 
earlier papers of LAGRANGE himself or from works of Euler and other pre- 
decessors. 

2. General principles or concepts of mechanics are misunderstood or neglected by 
Lagrange. 

3. Lagrange’s histories usually give the right references but misrepresent or slight 
the contents.** 

Not even Leonardo escapes his criticism. Commenting on the attribution to 
Leonardo of the laws ruling the free fall on inclined planes, Truesdell writes: 

From many other examples we know that LEONARDO often proposed rules of simple 
proportion, and that usually they came out wrong. Here is one that is right. Did 
Leonardo know it? Was not this, for him, just one of the hundred linear rules he 
proposed and never got around to trying out, the only difference being that this one 
concerns a problem later seen to have central importance, and this answer just turned 
out to be right? 

I fear so. We remember that in regard to free fall, Leonardo proposed several 
linear laws, mutually contradictory, and in the single one that turned out to be correct, 
he may simply have forgotten to repeat his other, incorrect, statements. 

The facts before us are simple: 

1 . In physics, some relations are linear and some are not. 

2. Leonardo never proposed any relation other than a linear one. 

3. Leonardo did propose dozens of linear relations. 

4. From 1 and 3, some of Leonardo’s rules may be expected to come out right. 

Counting only the cases when he was right and disregarding those when he was wrong 
may give a somewhat distorted estimate of his capacity as a natural philosopher. ^ 

* The rational mechanics of flexible or elastic bodies, p. 412n. 

** Essays in the History of Mechanics, pp. 246-247. 

% Essays in the History of Mechanics, p. 36. 
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Once again, he does not fear the wrath of defenders of the “politically correct” 
principle, when he writes an article hearing the title “ Sophie Germain: fame earned 
by stubborn error”, in which he makes statements such as: 

The above suffices to show that Sophie Germain was ignorant not only of elementary 
mechanics but also of the calculus of double integrals.* 

Of course, Truesdell does not blame the authors; they did all that they possibly 
could. Instead, he blames the historians, who arc not able to distinguish between 
sumptuously dressed banalities and conclusive results, who keep repeating already 
repeated statements without thinking them through and, by doing so, who con- 
tribute to the onset of an historic treacle, in which nobody anymore is able to 
distinguish a valid theorem from a futile exercise. 

While a physicist writing on the history of physics usually tells us what he thinks the 
old scientists must have thought, and a historian tells us whom they knew and what 
books they read. Professor JAMMER tells us mainly what they said they did .... 

A history apparently was not intended, since, despite critical remarks here and there. 
Professor JAMMER seems to be content with quiet juxtaposition of conflicting opin- 
ions. We read what was thought about mass not only by Newton and Hertz but 
also by Alois Hofler and Clementich de Engelmeyer. Now if Professor 
Jammer had found that Hofler and de Engelmeyer, although forgotten today, 
in fact did something important in mechanics, everyone should congratulate him on 
his success as a historian, but when he merely tells us what they thought - Hofler 
is quoted as saying, “The tonomonic quantity ‘dyne’ precedes logically the notion of 
‘one gram mass’ ”, and de Engelmeyer “our daily experience prepares us much 
better for the comprehension of the notion of force than of mass ...” - then we may 
well ask, who cares? Indeed, if it had been Huygens and Euler who had made the 
statement just quoted, a historian would do neither their memories nor his readers 
any service by perpetuating these flat vacuities.** 

Once the history of our culture was our common heritage, our pride and our lesson 
book for conduct both private and public. . . . For me, as long as 1 have tried to do 
research, the one and only school of method has been study, study and study of the 
masters. 

Today these simple truths are as obsolete currency as gold coin. . . . The beginner 
in history of science must be taught first of all what will make him, if he com- 
pletes apprenticeship, different from and independent of historians and scientists 
alike. Mathematics cannot be defined now except as that which mathematicians do; 
for physics, we substitute the word “physicists”, and soon the history of science will 
be defined as that which historians of science do and will likewise live a Parkinsonian 
life, independent equally of science and of history. Just as books on political history 
are written now to be read by political historians alone, and works on mathematics 
to be read by none but professional mathematicians, soon we can expect that books 
on the history of science will be meaningless except to historians-of-science, dumb 

* Bollettino di Storia delle Scienze Matematiche, 11-2 (1991), p. 12. 

** An Idiot’s Fugitive Essays on Science, pp. 170-171. 
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to scientists and to historians, serving only to produce more and more historians- 
of-science who are paid, if they can get jobs, to do nothing but indoctrinate more 
historians-of-science.* 

The history of science is different in kind from science itself and from ordinary his- 
tory. The material of the history of science is compact: Being history, it necessarily 
concerns the past, and because in the past science was a tiny and select vocation, not 
the factory job it is today, there is little to be read. What little there is, includes the 
highest intellectual achievement of our culture as well as a part of its finest artistic 
creation. . . . The example set today by the professions of scientists and historians 
is the worst that historians of science could choose to follow. Indeed, the history 
of science needs to be cleared and established. Thereafter, it ought to be learned. 
Although only a handful of persons could ever acquire the eccentric conjunction of 
skills and knowledge necessary in him who would do sound research in the history of 
science, there are many who can and should learn the results of that research. History 
of science should be studied and learned by every scientist, every historian, every 
person who seeks any intellectual footing in the Western culture. The great need of 
history of science today is for teachers.** 

These last two citations bring us to another topic on which Truesdell repeatedly 
insisted, in particular in his Lectio doctoralis , held on the occasion of the confer- 
ment of the “laurea honoris causa” at the University of Ferrara: the importance of 
the history of mathematics and, more generally, of the history of science, not only 
for the culture of the average citizen and, in particular, for the culture of scientists, 
but also for the interaction between historic and scientific culture. Without the 
history of science, a scientist’s culture is an end in itself; without at least a basic 
knowledge of science, the history of mathematics is impossible. 

All attempts to write a history of mathematics to be read by non-mathematicians 
turned out as disastrous failures: horrendous and stupid myths, the caricature or even 
the degradation of mathematics. Usually, the mathematician is represented as a sort 
of quack, rather than the way a great mathematician really is: a thinker, an organiser 
of ideas, a creator of new concepts, the bearer of the truth, the indefatigable worker 
in the fields of knowledge and beauty. In such works of vulgarization, mathematics 
itself is described as an arcane science, a spell, a crazy revelation. 

My discussion here refers not only to mathematics in its modern and specific sense, 
but also to mathematical science. The history of mathematical science, although 
probably not very useful to individuals who are ignorant of any form of mathematics, 
should be understandable to a much vaster public nowadays, more than any common 
research paper. It should be interesting, perhaps even useful, to anybody with some 
knowledge of mathematics, not necessarily very deep and detailed, and to anybody 
with some conscience of the value of mathematical science. A treatise on the history 
of mathematical science should differ profoundly from a modern research essay. * 

* An Idiot’s Fugitive Essays on Science, pp. 585-586. 

** An Idiot’s Fugitive Essays on Science, p. 589. 

t Translated from Honoris Causa. Lezioni dottorali di insigniti di laurea ad honorem in occasione 
del VI centenario dell’Ateneo. Academic Year 1991/1992. University of Ferrara, 1995, p. 49. 
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One of the main functions it should fulfill is to help scientists understand some aspects 
of specific areas of mathematics about which they still don’t fully know. What’s more 
important, it helps them too. By satisfying their natural curiosity, typically present in 
everybody towards his or her own forefathers, it helps them indeed to get acquainted 
with their ancestors in spirit. As a consequence, they become able to put their efforts 
into perspective and, in the end, also able to give those efforts a more complete 
meaning. By seeing beyond one or two leaves of the great tree of mathematics, they 
can comprehend the whole structure, discern the subdivision of its branches and trace 
back its roots. 

When a scientist gets the chance to measure his own efforts by comparing them with 
the results of the immortals, he will feel perhaps less compelled to vie for supremacy 
over the pygmies of his level and will try, instead, to achieve one or two small goals, 
which will have a probability to survive the test of time. This is a moral advantage,* 

Mathematical science offers examples of what human reason can achieve, of the 
safety that comes together with reason; by reaching a rational clearness, they put also 
in evidence the limits of reason and the unstable foundations on which all sciences 
are based. The methods that turned out to be successful and those that have been 
surpassed and failed, the paths that guided to the final destination and the blind ones, 
can be learned exclusively through the history of mathematical sciences. This is an 
old-style history, the history of men and of their actions; with the words of Savile: 

No other study is as suitable as history to guide the life of mankind. 

This is a social advantage.** 

Still, relevance and fame are two different concepts. Truesdell does not choose 
his heroes simply among the recognized mathematicians. The scientists who are 
brought by Truesdell to the level of heroes, and take, in this way, a curtain call 
on the scene of Truesdell’s investigations, are sometimes practically unknown, not 
only to the general public, but also to historians of science. 

Particularly enlightening examples arc John Herapath and John James Water- 
ston. Both of them were the first ones to give a substantial contribution to the kinetic 
theory of gases, after the publication of Daniel Bernoulli’s Hydrodynamica : both 
of them had sent their work to the Royal Society which, in both cases, rejected it, 
giving rise to a series of disputes. Herapath, though he used the momentum instead 
of the kinetic energy of particles for his definition of temperature, had shown that 
the kinetic theory was suitable as a first explanation for phase transitions, diffu- 
sion, and the propagation of sound. In a series of letters, in which Herapath was 
exhorted to abandon these speculative aspects and devote himself to experiments, 
the president of the Royal Society, Sir Humphry Davy, wrote, “having considered 
a good deal the subject of the supposed real zero, I have never been satisfied with 
any conclusions respecting it. I cannot see any necessary connexion between the 



* Translated from Honoris Causa, pp. 49-50. 

** Translated from Honoris Causa, p. 50. 
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capacity of bodies for heat, and the absolute quantity they contain; and temperature 
does not measure a quantity, but merely a property of heat.” 

As far as Waterston is concerned, long before the work of Joule, Thompson and 
Kit) nig, he had sent in 1845 a note on kinetic theory to the Royal Society. This 
document was rejected by the Society as well. On that occasion, one of the two 
referees objected that the principle that pressure depended on molecular collisions 
with the walls, a central hypothesis of Waterston’s document, was “by no means 
a satisfactory basis for a mathematical theory”. The second referee just wrote that 
“the paper is nothing but nonsense, unfit even for reading before the Society”. The 
work in question was rediscovered by Lord Rayleigh in 1891 and published in 
1893, ten years after Waterston’s death. 

In the contrast between the genius and the Academy, Truesdell chooses def- 
initely the first and draws two important conclusions from the above-described 
events. In the first place, he praises an anarchical approach in academic research 
and organization. Truesdell himself depicts the academic world with his incompa- 
rable style, as somewhat serious and somewhat ironical, and describes it in ways 
that, besides some details, could very well fit the Italian situation: 

Our academic life presents to the foreigner a lamentable scene of chaos. No-one 
knows who is on top. If in University 1 Professor A is a demigod, we have only to 
consult Professor B in University 2 to learn that in his department A would not qualify 
even as an assistant. True, A belongs to six national committees, has a million dollar 
grant from the Central Spy Bureau, and has published eight successful textbooks, 
but B , who points to A’s textbooks as models of nonsense, has written 216 research 
papers with twenty-three co-authors and also is consultant for four major corpora- 
tions, assistant editor of five journals and second vice-president of a professional 
trade-union.* 

The second conclusion drawn by Truesdell from the events in which Herapath 
and Waterston had been involved is more important. He thought that a system mak- 
ing use of anonymous referees favored a reactionary attitude and made it difficult 
for new ideas to emerge. In the journals he founded, the two Archives , this system 
was replaced by one that the name of the communicator of a paper was published 
immediately beneath the name of the work’s author. He thought that this change 
would successfully replace anonymous irresponsibility with individual and clear 
responsibility. Unfortunately, I fear that his thought was wrong: in the framework 
of modern organization, the problem is not anymore the protection of a genius from 
the closed and reactionary attitude of the establishment, but rather the protection of 
journals from the huge amount of irrelevant material which threatens to flood them. 
From this point of view, a comparison between different opinions is preferable to 
leaving it all to the whim of just one communicator. After all, no system is perfect. 



An Idiot’s Fugitive Essays on Science, p. 399. 
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Clifford Truesdell was a singularity among all prominent scientist-scholars of the 
twentieth century. He believed that the pinnacle of civilization had been reached 
in the 18th century and that things have gone downhill ever since. He had no 
television set and no radio, and I doubt that he ever used a typewriter, let alone a 
computer. Many of the letters he sent me were written with a quill pen. (However, 
he did not reject such modern conveniences as flush toilets and air-conditioning.) 
He loved baroque music and did not care very much for what was composed later 
on. He owned very fine harpsichords and often invited masters of the instrument 
to play, often before a large audience, in his large home in Baltimore, which he 
called the “Palazzetto”. He collected art and antique furniture, mostly from the 18th 
century. He even often dressed in the manner of an 18th century gentleman. Most 
importantly, he admired the scientists of the 18th century and, above all, Leonhard 
Euler, whom he considered to be the greatest mathematician of all time. Actually, 
in the 18th century, mathematics and physics were not the separate specialities 
they arc today, and the term “Natural Philosophy” was the term then used for the 
endeavor to understand nature by using mathematics as a conceptual tool. C.T. tried 
and succeeded to some extent in reviving this term. 

Clifford Truesdell was extremely prolific and he worked very hard most of the 
time. When he didn’t, he liked to eat well and drink good wine. This is perhaps 
one reason why he loved Italy so much, and spent extended periods of time there. 
Clifford Truesdell was not only an eminent scientist but also a superb scholar. 
He mastered Latin perfectly. He could not only read and understand the classical 
literature, which was written in Latin, but even wrote at least one paper in Latin. 
He was fluent not only in his native language, English, but also in Lrench, German, 
and, above all, Italian. He wrote papers and gave lectures in all of these languages. 
All of these attributes are particularly rare for somebody born in Los Angeles, 
California. 



* This paper is the text, with minor revisions, of a lecture given by the author at the Meeting in 
memory of Clifford Truesdell , held in Pisa, Italy, in November 2000. 
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Clifford Truesdell was only in his early 30s in the early 1950s when he had 
already established himself as perhaps the world’s best informed person in the field 
of continuum mechanics, which included not only the classical theories of fluid 
mechanics and elasticity but also newer attempts to mathematically describe the 
mechanical behavior of materials for which the classical theories were inadequate. 
In 1952, he published a 175 page account entitled “The Mechanical Foundations 
of Elasticity and Fluid Dynamics” in the first volume of the Journal of Rational 
Mechanics and Analysis. 

Springer- Verlag, with Headquarters in Heidelberg, Germany, has been an im- 
portant publisher of scientific literature since the beginning of the 20th century. In 
the 1920s it published the Handbuch der Physik, a multi-volume encyclopedia of 
all the then known knowledge in Physics, mostly written in German. By the 1950s 
the Handbuch had become hopelessly obsolete, and Springer- Verlag decided to 
publish a new version, written mostly in English and called “The Encyclopedia 
of Physics”. Here is Clifford Truesdell’s own account, in a 1988 editorial in the 
Archive for Rational Mechanics and Analysis, of how he became involved in this 
enterprise: 

In 1952 Siegfried Flugge and his wife Charlotte were organizing and carrying through 
the press of Springer- Verlag the vast, new Encyclopedia of Physics (Handbuch der 
Physik), an undertaking that was to last for decades. Joseph Meixner in that year 
had called Fliigge’s attention to my article The mechanical foundations of elasticity 
and fluid dynamics . . . On October 1, 1952, Flugge invited me to compose for the 
Encyclopedia a more extensive article along the same lines but presenting also the 
elementary aspects of continuum mechanics, which my paper . . . had presumed to be 
already known by the reader. In the second week of June 1953, Flugge spent some 
days with us in Bloomington, mainly discussing problems of getting suitable articles 
on fluid dynamics for the Encyclopedia. ... He asked me to advise him. and by the 
end of his visit I had agreed to become co-editor of the two volumes under discussion. 

It was C.T., in his capacity as co-editor, who invited James Serrin to write the 
basic article on Fluid Mechanics. C.T. himself decided to make two contributions 
dealing with the foundations of continuum mechanics, the first to be called “The 
Classical Field Theories of Mechanics” and the second “The Nonlinear Field The- 
ories of Mechanics”. Here is what he wrote about this decision, in the preface to a 
1962 reprint of his Mechanical Foundations: 

By September, 1953, 1 had agreed to write, in collaboration with others, a new ex- 
position for Fliigge’s Encyclopedia of Physics. It was to include everything in The 
Mechanical Foundations, supplemented by fuller development of the field equations 
and their properties in general; emphasis was to be put on principles of invariance and 
their representations; but the plan was much the same. As the work went into 1954, 
the researches of Rivlin and Ericksen, and of Noll made the underlying division into 
fluid and elastic phenomena, indicated by the very title The Mechanical Foundations 
of Elasticity and Fluid Dynamics no longer a natural one. It was decided to split the 
projected article in two parts, one on the general principles, and one on constitutive 
equations. The former part, with the collaboration of Mr. Toupin and Mr. Ericksen, 
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was completed and published in 1960 .... In over 600 pages it gives the full material 
sketches in Chapters II and III of The Mechanical Foundations. The second article, to 
be called “The Nonlinear Field Theories of Mechanics”, Mr. Noll and I are presently 
engaged in writing. The rush of fine new work has twice caused us to change our basic 
plan and rewrite almost everything. The volume of grand and enlightening discoveries 
since 1955 has made us set aside the attempt at complete exposition of older things. 
It seems better now to let The Mechanical Foundations stand as final for most of the 
material included in it, and to regard the new article not only as beginning from a 
deeper and sounder basis than could have been reached in 1949 but also as leaving 
behind it certain types of investigation that no longer seem fruitful, no matter what 
help they may have provided when new. 

In 1951 I started employment as a “Wissenschaftlicher Assistant” (scientific as- 
sistant) at an Institute of Engineering Mechanics (Lehrstuhl fur Technische Mecha- 
nik) at the Technical University of Berlin. In late 1952 I came across a leaflet from 
Indiana University offering “research assistantships” at the Graduate Institute of 
Mathematics and Mechanics. I applied and was accepted to start there in the Fall 
of 1953. After arriving, I found out that I had become a “graduate student”. At 
first I did not know what that meant, because German universities have nothing 
corresponding to it. But the situation was better than I expected, because no work 
was assigned to me, and I could pursue work towards a Ph.D. degree full time. 

I asked Clifford Truesdell to be my thesis advisor, because he was the local 
expert on mechanics and I thought that would help me after my return to Germany. 
At the time I did not know that C.T. had already become the global expert on 
mechanics. I also did not know at the time that I was C.T.’s first doctoral student. 
I was told later that other graduate students did not select him as a thesis advisor 
because they thought that he was too tough. It is true that he was tough and asked 
much of me, but he was also extremely kind and helpful, and, in the end, he had 
more influence on my professional career than any other person. I learned from 
him not only the basic principles and the important open questions of mechanics, 
but also how to write clear English. In addition, his wife Charlotte and he did very 
much to make me feel comfortable in Bloomington. They invited me to their home 
very frequently to share the gourmet dinners prepared by Charlotte. As a thesis 
topic, C.T. gave me a paper written by S. Zaremba in 1903 and asked me to make 
sense of it. I found that, in order to do so, a general principle was needed, which 
I called “The Principle of Isotropy of Space”. This principle also served to clarify 
many assertions in mechanics that had been obscure before (at least to me). C.T. 
had a wonderful sense of humor. One day he put up his son, then about 10 years 
old, to ask me: “Mr. Noll, please explain the Principle of Isotropy of Space to me.” 
Actually, I think I have found a good explanation only recently, about 40 years later. 
I had a very good background in the concepts of coordinate-free linear algebra and 
used it in my thesis. At the time, C.T. was uncomfortable with this approach and 
forced me to add a coordinate version to all formulas. However, he did not have a 
closed mind and later, in a letter to me dated August 4, 1958, he wrote: 
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I must also admit that the direct notations you use are better suited to fundamental 
questions than are indicial notations. Your present mathematical style is smoother 
and simpler than that in your thesis. 

After I received my Ph.D. in September 1954 and returned to Germany, C.T. and 
I stayed in contact by mail, and I saw him again when he gave a lecture in Berlin in 
June 1955 (“Das ungeloste Hauptproblem der endlichen Elastizitatstheorie”). The 
term of my position in Berlin ran out in the Fall of 1955 and I needed a new job. 
On C.T.’s recommendation I was offered an Associate Professorship at Carnegie 
Institute of Technology (now Carnegie Mellon University), where I moved in the 
Fall of 1956 after having spent a year at the University of Southern California. Here 
is an excerpt from a letter he wrote to me on February 14, 1956, after I accepted 
the position at Carnegie Tech: 

I think you deserve your position at Carnegie and hope you will like it. If I might 
offer a word of advice, it would be to maintain the level set by your two papers 
in the Journal. Nowadays so many people are publishing like rabbits that volumes 
of papers make little impression. Many young people publish too much when they 
find out how easy it is to do something others have not done, especially something a 
senior colleague is too dull to see. It is of course necessary to avoid setting such high 
standards that one’s publication ceases entirely, as is the case with many well known 
savants. 

Eater, in December of the same year, he wrote me the following letter: 

Dear Noll, 

Ericksen and I have long promised to write a second article, The nonlinear field 
theories of mechanics, to cover exact work on elastic, fluid, and plastic phenomena. 
Ericksen is far behind on the first article, and therefore we have not even outlined this 
second one. Although I have not yet consulted Ericksen, I feel sure that he would be 
relieved if you would consent to join us and do a major part of the second article. 
The order of authors would be according to the amount of work provided, and if you 
could do most of the work, you would be the senior author. 

You know both the old Mechanical Foundations and the new Classical Field 
Theories thoroughly. Therefore you have as much background as anyone else, and 
you have made important contributions yourself. I have not been able to think of a 
good organization. All I can say is that I should prefer to cut down on the amount 
of space given to special proposals prior to 1948 and to emphasize general work such 
as you and Rivlin have done. Whether it is better to treat very general visco-elastic 
theories first or to give a definitive presentation of classical finite elasticity first is not 
clear to me. The article should be exhaustive, but for the work prior tol948 I think 
we need only condense the Mechanical Foundations or change the emphasis, as I 
believe the list of sources cited there is virtually complete. The presentation should 
be concise, but the length is not critical. 

If this proposal is acceptable to you, I will put it up to Ericksen at once. Also, 
if you could start right away on the organization and suggest a distribution of material 
among the three authors it would help. I believe I can begin active work within two 
months, but I do not expect to be able to prosecute exclusively this one project in the 
near future. 
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Here is an excerpt from my answer, written on January 2, 1957: 

I shall be glad to accept your proposal and join you and Ericksen as co-author of 
“the nonlinear field theories of mechanics”. It will be a challenging task for me, but I 
shall do the best I can. I have thought a little bit about the organization. Here is very 
roughly what I would like to propose: 

The first section should contain general remarks concerning constitutive equa- 
tions, their classification, invariance requirements, constraints, definitions of terms 
like “material”, “isotropic”, “aelotropic”, etc. 1 have been thinking about this subject 
for quite a while, and I plan to include my ideas on it in a paper on the axiomatic 
foundation of mechanics, which I plan to write in the near future. 

I think it is best to give then an account of finite elasticity and nonlinear fluid 
theory (Reiner-Rivlin). These are the simplest and best known nonlinear theories. 
Next, more general stress-strain relations (as those proposed by Rivlin and Erick- 
sen) could be treated. Finally, constitutive equations involving stress rates (as those 
proposed by Oldroyd, Cotter and Rivlin, and in my thesis) and their special cases 
(hypo-elasticity) could be worked out, and the connection of these general theories 
with the simpler ones could be established. 

I would be willing to do the general remarks and also most of the treatment of 
the various constitutive equations in general. However, I would appreciate some help 
with the special cases and solutions. I would think, for instance, that Ericksen would 
have little trouble to present all special solutions valid for an arbitrary strain energy 
function in finite elasticity by taking his two papers (TAMP 5: 466-489 and J. Math. 
Phys. 2: 126-128) as a basis. Also, it is probably easier for one of you to deal with 
finite deformations of shells, anisotropic elastic bodies, and similar material. I would 
think, too, that you would want to do the section on hypo-elasticity yourself. 

I am open to alternative and more detailed proposals concerning the organiza- 
tion as well as the distribution of the material to the authors, and I am looking forward 
to have your opinion. 

The new task of working on the Nonlinear Field Theories of Mechanics (NLFT) 
changed my professional life forever, because it focused my attention on the foun- 
dations of continuum mechanics. C.T. was very good at reading, digesting, and 
summarizing existing literature. I am not. I cannot help rethinking and recreating 
whatever subject I wish to understand. This had perhaps the effect of improving 
the NLFT in the end, but it also slowed down the progress towards completion. 
Sometimes C.T. became very impatient with me, and with good reason. The low 
point was reached when C.T. wrote the following letter to me: 

Dear Walter, 

Upon my return from the West, I did not find any manuscript of our article. 
As I said in Pittsburgh, I feel now that you should withdraw from the article. This 
is the third period of half a year or more in which you have not sent me a line, and 
again the subject is slipping away from us. My having had to disturb you every few 
weeks or days for the past six months so as to get your assurance that a large section 
of manuscript would be ready in a few days has been very painful, the more so since 
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fruitless. I have put in this treatise the better part of all my work in the past five years, 
and as I prepare for another trip to Europe, I cannot drag this weight of responsibility 
any longer. 

I plan to finish the article by intensive work in January and February, and I 
request you to send me all material I gave you before I went to Europe last spring. 

Surely, you know that I feel this loss deeply. The parts written by you are 
far better than anything I could write, and your criticism and correction of the parts 
I wrote have been of the highest value. There is no one else who could do the job 
you are capable of doing, but you have not done it, and some sort of article must be 
published nevertheless. There is too much invested in it already. 

Well, I pulled myself together and I helped to finish NLFT. 

During our collaboration we were in constant contact by mail or in person. 
I visited Bloomington in November 1958. C.T. came to Pittsburgh in June 1959 and 
gave lectures at the Mellon Institute (later a part of Carnegie Mellon University). In 
August 1961, C.T. left Indiana University and he and Charlotte moved to Baltimore 
where he joined The Johns Hopkins University with the title Professor of Rational 
Mechanics. In September 1961, C.T. visited Pittsburgh again. During the academic 
year 1962/63, C.T. arranged for me to come to Johns Hopkins University as a 
visiting professor. 

Occasionally, C.T. and I had disagreements on terminology. Perhaps the most 
interesting concerned what we finally called “The Principle of Material Frame- 
Indifference”. Here arc some excerpts from letters that we exchanged on this sub- 
ject: 

C.T. to W.N., August 1, 1958: 

I was just planning to write something mentioning your old “isotropy of space”. It 
seems to me the term “principle of objectivity” shares one of the undesirable features 
of the old term, namely, it is too vague. I was going to use “the principle of material 
indifference”. However, it would be bad for there to be three names running around 
for the same principle. If you find my suggestion appealing, let me know right away; 
otherwise, I will use your present name. 

W.N. to C.T., August 5, 1958: 

I have switched from “isotropy of space” to “objectivity” because the former suggests 
only that there are no preferred directions in static space, which is much less than is 
implied by the principle of objectivity. 

If “objectivity” means independence of the observer, as I believe it does, then 
it seems to me that it is the correct term. One could remove some of the vagueness 
you mentioned by using “principle of objectivity of material properties” (or perhaps, 
“principle of material objectivity”). If you think that this is advisable, please insert 
“of material properties” after “principle of objectivity” in the table of contents (sec- 
tion 1 1 ), in line 2 of page 3, in the title of section 1 1 on page 20, and in line 15 from 
the bottom of page 21.1 think that no additional changes are necessary because the 
other references to the “principle of objectivity” cannot lead to confusion. 
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C.T. to W.N., August 9, 1958 (handwritten): 

Dear Noll, 

I have to confess that my dislike of “objective” is subjective. Psychologists, 
sociologists, and all sorts of other charlatans and fakers use “objective” when the 
only sense I can find for what they claim is “nonsensical” or “thoughtless”. Thus I 
exclude, banish and ostracize “objective” as the opposite of ’’subjective” in my own 
usage, but of course there is no reason why this should influence you. 

Cordial regards, CT 

C.T.’s objection did influence me and I am now glad it did. I suggested that we 
change indifference to frame-indifference. C.T. accepted that and I believe now that 
the term we finally used expresses the idea better than any of the ones used before. 

The entire task of writing the NLFT turned out more extensive than was ex- 
pected, and C.T. called it the “monsterino”. (He called the earlier Classical Field 
Theories the “monster”.) The monsterino was finally published in 1965. There were 
many people who helped us with the work by reading parts of the manuscript and 
offering corrections, critique, and suggestions. They include B.D. Coleman, J. Er- 
icksen, M.E. Gurtin, R. Toupin, K. Zoller, C.-C. Wang, D.C. Leigh, C.C. Hsiao, 
W. Jauzemis, and A.J.A. Morgan. On June 25, 1961 C.T. wrote to me: 

I now have an excellent secretary, so that the preparation of the manuscript should be 
easier. 

That helped, too. 

Clifford Truesdell was very meticulous about attributions. I believe he bent over 
backwards in my favor when he described, in a footnote to the Introduction to 
NLFT, how the work was divided between us: 

Acknowledgment. This treatise, while it covers the entire domain indicated by its 
title, emphasizes the reorganization of classical mechanics by Noll and his associates. 
He laid down the outline followed here and wrote the first drafts of most sections in 
Chapters B, C, and E and of a few in Chapter D. Among the places where he has 
given new results not published elsewhere, shorter proofs, or major simplifications 
of older ideas may be mentioned (21 items). The larger part of the text was written 
by Truesdell, who also took the major share in searching the literature. While Noll 
revised many of the sections drafted by Truesdell, it is the latter who prepared the 
final text and must take responsibility for such oversights, crudities, and errors as 
may remain. 

The Nonlinear Field Theories was reprinted in 1992. Here is an excerpt from 
the preface to the second edition: 

This volume is a second, corrected edition of The Nonlinear Field Theories of Me- 
chanics, which first appeared as Volume 111/3 of the Encyclopaedia of Physics, 1965. 
Its principal aims were to replace the conceptual, terminological, and notational chaos 
that existed in the literature of the held by at least a modicum of order and coherence, 
and second, to describe, or at least to summarize, everything that was both known 
and worth knowing in the held at the time. Inspecting the literature that has appeared 
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since then, we conclude that the first aim was achieved to some degree. Many of the 
concepts, terms, and notations we introduced have become more or less standard, 
and thus communication among researchers in the field has been eased. On the other 
hand, some ill-chosen terms are still current. Examples are the use of “configuration” 
and “deformation” for what we should have called, and now call, “placement” and 
“transplacement”, respectively. (To classify translations and rotations as deformations 
clashes too severely with the dictionary meaning of the latter.) We believe that the 
second aim was largely achieved also. We have found little published before 1965 
that should have been included in the treatise but was not. However, a large amount 
of relevant literature has appeared since 1965, some of it important. As a result, were 
the treatise to be written today, it should be very different. On p. 12 of the Introduction 
we stated “. . . we have subordinated detail to importance and, above all, clarity and 
finality”. We believe now that finality is much more elusive than it seemed at the 
time. The General theory of material behavior presented in Chapter C, although still 
useful, can no longer be regarded as the final word. The Principle of Determinism 
for the Stress stated on p. 56 has only limited scope. It should be replaced by a more 
inclusive principle, using the concept of state rather than a history of infinite duration, 
as a basic ingredient. In fact, forcing the theory of materials of the rate type into the 
general framework of the treatise as is done on p. 95 must now be regarded as artificial 
at best, and unworkable in general. This difficulty was alluded to in footnote 1 on 
p. 98 and in the discussion of B. Bernstein’s concept of a material on p. 405. This 
major conceptual issue was first resolved in 1972 [“A New Mathematical Theory of 
Simple Materials” by W. Noll , Archive for Rational Mechanics and Analysis, Vol. 48, 
pp. 1-50], and then only for simple materials. The new concept of material makes it 
possible, also, to include theories of plasticity in the general framework, and one can 
now do much more than “refer the reader to the standard treatises”, as we suggested 
on p. 11 of the Introduction. 

About three years ago. Springer- Verlag sold the right to translate NLFT into 
Chinese. I recently received copies of the Translation. 1 



1 After this paper was written, I was informed by Springer- Verlag that a second reprinting will 
appear in 2003. 
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The following comments are a slightly emended version of a talk delivered at a 
meeting of the Society for Natural Philosophy in December 1 985, together with 
additional remarks appended in May 2003. 

This meeting of the Society for Natural Philosophy has been arranged as a spe- 
cial tribute to its principal founding member, Clifford Truesdell, in recognition of 
his many contributions to the Society since its beginning at The Johns Hopkins 
University in March 1963. In this group picture of the participants of the first 
meeting (see Figure 1) you will see Clifford Truesdell at next to the left in the 
front row, and no doubt you will also recognize a number of other faces, all 22 
years younger and perhaps 22 years wiser, than today. 

In the intervening years the Society has sponsored 27 additional meetings, hold- 
ing fast throughout its existence to the original form, to small size, no publications, 
and the encouragement of quality of research - primarily in rational continuum 
mechanics and its foundations, though with occasional forays into related mathe- 
matical disciplines. And outstanding menus, as well. 

Consistent maintainance of the original principles of the Society is an achieve- 
ment due more to Clifford Truesdell than to any other single individual. It is impos- 
sible in a few short minutes to do justice to his remarkable and myriad accomplish- 
ments, in papers, monographs, memoirs, and books ranging from mathematical 
sciences, to rational mechanics, natural philosophy, and the history of science. 
Already in the forties there was clear evidence of this future in several unusual 
publications, perhaps not today known to everyone here. 

As a student at Indiana University I was privileged early to see the power of his 
thought in exceptional lectures which he offered in elasticity and kinetic theory - 
perfect in content and stately in development - thus, the appearance in 1952 of the 
seminal paper “Mechanical Foundations of Elasticity and Fluid Mechanics” came 
to me not so much as a surprise but rather as a remarkable opportunity to learn 
more from a great master. Several years before, he had initiated research on the 
kinematics of vorticity, together with the interrelations of vorticity to thermody- 
namics. This culminated in his next work, the elegant monograph “The Kinematics 
of Vorticity”, published by the Indiana University Press but now out of print, 
and, following without pause, his great paper on the absoiption and dispersion 
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Figure 1. First Meeting of the Society for Natural Philosophy, March 25-26, 1963. The front row left-to-right: L.M. Milne-Thomson, C. Truesdell, 
B. Coleman, J. Serrin, J.L. Ericksen, H. Markovitz, E. Sternberg, W. Noll, H. Grad. R. Toupin, R. Rivlin. Back row second from the right: Charlotte 
Truesdell. 
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of sound according to the viscosity formulae of Navier and Stokes. Altogether 
one of the most audacious first appearances of a new star in the firmament of 
mechanics ! 

While time makes it impossible to dwell upon still another quiver in his bow, 
I must add that it was exactly following these works that he began the three cel- 
ebrated prefaces for the Opera Omnia of Leonhard Euler - one each on fluid 
mechanics, acoustics, and elasticity. These established his fame as the leading 
modern interpreter of Eighteenth Century Mechanics, and of course brought to 
our attention, as nothing else, the contributions of Euler to our science. 

In 1960 Truesdell and Richard Toupin discovered the important entropy rela- 
tions now known as the Clausius-Duhem inequality, specifically including in it 
the crucial external heat supply term. This development made possible the later 
theory of Coleman and Noll, providing thereby, for the first time, a rational and 
rigorous approach to the foundational aspect of constitutive restrictions. Moreover, 
as appeared in retrospect many years following, the radiation supply term provided 
a vital link between classical thermodynamics and modern theory. 

Truesdell’s work on the Clausius-Duhem inequality was severly criticized by 
those whose understanding of thermodynamics had been formed without benefit 
from study of the masters of the subject. The criticisms could not be fully an- 
swered, ironically, because the foundations of thermodynamics were themselves 
open to question. In any case, this neglected and nettle-choked thicket of physics 
became one of the principal areas of Truesdell’s later work: of course his detractors 
never dreamed that anyone would set foot in those Augean stables, nor of course 
did they have the courage to do so themselves. Nevertheless, the difficulties of 
thermodynamics were enough to require many further years of thought before the 
ideas were ripe for presentation, and other projects loomed immediately. 

Two monumental volumes for the Handbuch der Physik appeared in the early 
sixties, “The Classical Field Theories” with Richard Toupin, and “Non-linear Field 
Theories of Mechanics” with Walter Noll. It is almost impossible to know how to 
approach these volumes. Their weight of scholarship and perception, their very 
completeness, daunt the mere mortal who opens them. Even so, the rich veins of 
ore were, individually and internationally, mined by the many. Here one may also 
recall the impress which Truesdell’s views had on the seven companion volumes of 
the Handbuch der Physik; for the most paid he selected not only the general subject 
matter, but also the corresponding authors. Even now, thirty years later, many of 
these volumes arc as fresh as ever - required reading for the acolyte. 

After 1964 thermodynamics and kinetic theory began to occupy Clifford more 
heavily than ever before, as if only the most monumental problems were now suf- 
ficient of challenge. In 1957 he had published a first paper on chemical mixtures, 
followed by a second paper in 1968 which inspired much further work on this most 
difficult of subjects. Then came in 1969 the polemical masterpiece “Rational Ther- 
modynamics”, summarizing his thoughts to that time and forming a springboard 
(I almost wrote “launching pad” but rejected this) for future work. 
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Figure 2. Journal of Rational Mechanics and Analysis: Volume 1, Number 1. 



If I may digress a moment, let me recall that this scientific activity was paral- 
leled by continuing effort to re-establish both Rational Mechanics and the methods 
of Natural Philosophy as integral parts of physical science. Beyond the formation of 
the Society for Natural Philosophy, there is the founding of the Journal of Rational 
Mechanics and Analysis in 1952 (see Figure 2), its reappearance as the Archive 
for Rational Mechanics and Analysis in 1957, and the inauguration of the Archive 
for History of Exact Sciences in 1960, to name his major and most longstanding 
contributions in this direction. Galileo had said “Nature first made things in her 
own way, and then made human reason skillful enough to understand - but only 
by hai'd work - some parts of her secrets.” Clifford saw well that a combination 
of mathematical analysis and historical sense was the key to this understanding, 
and he emphasized always, as Dedekind put it, “What is provable should not be 
believed in science without proof.” In these endeavors his practice emulated his 
preachment to the Society at an earlier meeting. “The aim of rational mechanics is 
to provide a sound conceptual framework for description of nature as human senses 
perceive it and to create patterns of systematic inquiry and inference such as to or- 
der and interrelate the phenomena thereby conceived.” Deeper understanding was 
to come through the successive works of generations, together with our experience 
of materials themselves. 
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I would like to bring you the message that Clifford’s evangelism succeeded 
beyond his expectations, but must sadly report that the world is still peopled by 
pagans (and worse) - a theme which Clifford of course has voiced frequently in 
his scientific essays. In any case I herewith dedicate for his perusal two sentences 
culled at random from today’s scientific journalese - the first “If space has nine 
dimensions and matter is strings then the mysteries of the universe may soon come 
clear” (so help me God), and the second, “Among enhancement techniques the 
National Research Council will investigate arc sleep learning, group cohesion tech- 
nologies, accelerated intelligencing and parapsychological stress.” But one must 
put aside the ridiculous in the study of nature. 

Returning then to the question of classical thermodynamics, one observes from 
the outset that this is the sole discipline of classical physics where serious argu- 
ment about fundamentals remains. Even while moguls of science grant entropy 
and energy all-pervasive influence, it is impossible to obtain agreement from these 
grandees on the two laws of nature from which these concepts derive. As recently 
as 1970 the subject was innocent of mathematical structure, and the lack of preci- 
sion showed throughout. The introductory essay in “Rational Thermodynamics”, 
written in 1968, tells this story faultlessly. 

Truesdell attacked this miasmic swamp with vital energy, and brought others as 
well to the fray. The outcome is now clear; there is no longer doubt that energy 
and entropy can be set upon sure and credible foundations,* vindicating Clifford’s 
intuition when he proposed the Clausius-Duhem inequality in 1960. His own paid 
of this research, canned out from 1974-1984, included in particular a rigorous 
axiomatization of reversible thermodynamics. There is also in this work a corollary 
result, a beautiful pearl, which I think deserves to be better known than it is. This is 
his discovery that, for reversible systems, the law of thermodynamics can be stated 
without recourse to the mechanical equivalent of heat, indeed in such a way that the 
existence of this mechanical equivalent can be deduced, much as the existence of 
absolute temperature can be inferred from the second law. Generalizations of this 
to arbitrary thermal systems have been given by later writers, leading me to predict 
that ultimately this method of viewing the First Law will become a new paradigm 
of physics. 

Books issue in continuing stream from “II Palazzetto”: “Introduction to Rational 
Elasticity” (with Wang), “The Tragicomical History of Thermodynamics, 1822- 
1854”, “A First Course in Rational Mechanics”, “An Idiot’s Fugitive Essays on 
Science”, “Fundamentals of Maxwell’s Kinetic Theory of a Simple Monatomic 
Gas” (with Muncaster), the last his magnus opus on a subject which has been with 
him for 40 years, and even papers on antique furniture and the routes to Hell. What 



* Note added, March 2003. This statement needs some clarification: The context should be re- 
stricted to phenomenological thermodynamics, in which heat and work are the primitive elements, 
and where also there are given differential forms defining the heat and work associated to processes in 
question. Beyond this, there are systems where one may credibly define energy and entropy, though 
such definitions may restrict the set of available processes of the systems. 
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more proof is needed for the converse of the famous epigram of Leonardo da Vinci: 
“Quiet water becomes stagnant. Iron rusts from disuse. So doth inactivity sap the 
vigor of the mind.” 

Clifford Truesdell inspired a generation to study Natural Philosophy, and per- 
suaded the learned that Athena is not dead. He has repeatedly and deservedly been 
honored for his many accomplishments. There is the Bingham Medal of the Soci- 
ety of Rheology which he received in 1963, the Modesto Panetti Gold Medal and 
Prize from the Turin Academy of Sciences in 1967, and the George David Birkhoff 
Award for Applied Mathematics from the American Mathematical Society in 1978, 
as well as Foreign Membership in the Accademia Nazionale dei Lincei. I dare say 
that the Society for Natural Philosophy as well would wish to present formally 
to him its highest constitutional award: the great regard in which he is held, the 
high esteem he is owed, and our continuing best wishes for the furtherance of the 
scientific program in rational mechanics which he initiated at the beginning of his 
career. 

The following additional words were appended in May, 2003, drawn from a 
lecture given in November, 2000, at Pisa at the Meeting in memory of Clifford 
Truesdell. 

Generous in praise, criticism, judgement and friendship, none bestowed lightly: 
ordinary words do not suffice for Clifford Truesdell. He had great vision accom- 
panied by exceptional accomplishments, at the same time supremely gifted, prodi- 
giously learned and with almost overwhelming energy. He lived life not just to 
the full, but to overflowing in every activity he undertook: his prodigal lifestyle, 
his many volumes of correspondence, his historical research and essays, his many 
books, both scientific and philosophical, and his exceptional editorial activities. 

He was lavish in his help for young researchers and was happy with the success 
of those who followed in the directions he pioneered. He chose his friends well, 
and from the group around him and around the Archive some of the greatest work 
of mechanics and thermomechanics in the last century resulted. 

Theodore Roosevelt’s old fashioned words apply still to Clifford and to his 
goals, “Far - better it is to dare mighty things, to win glorious triumphs, even though 
checkered by failure, than to take rank with these poor spirits who neither enjoy 
nor suffer much, because they live in the gray twilight that knows neither victory 
nor defeat.” 

All the same, it is necessary to add that Clifford was not without the faults of 
the gods - perhaps he was aw are of this - even allowing this aspect of his character 
to be a full paid of his personality. Clifford could become upset on occasion, not 
always fairly; there were exasperating times when he became all too human. 

Let me mention one moment engraved on my memory. On one occasion when 
my wife Barbara and I were visiting Clifford and Charlotte, after the dinner, while 
he and I were standing at the top of the magnificent divided stairway at the 
Palazzetto, the subject of absolute temperature in classical thermodynamics came 
up. After all at that time we were both violently infected by the thermodynamic 
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plague. He would not accept my view of the “hotness manifold”, which perhaps 
unduly boldly I had ventured to discuss. While always expressed with politeness, 
the differing views clearly upset him - he was finally led to utter the ultimate put 
down: “The problem with you, deal - James, is that you never understood Bernard 
Coleman.” Of course, that is no doubt true 

Let me add something about the Archive (Archive for Rational Mechanics and 
Analysis). I will ramble a bit, because the main aspect of the Archive - what it is 
and what it has meant is familial - . 

In the years between 1946 to 1960 there was a wonderful period of optimistic 
activity in the United States, based on the successful ending of the Great War 
against Germany and Japan, and also on the need to supply many of the require- 
ments of life which had not been met since the beginning of the Great Depression 
in 1930. 

Thus a period of optimism was just beginning and American mathematics was 
coming to maturity. Before that time, American higher mathematics had been con- 
centrated almost entirely in the universities of the East Coast - Harvard, MIT, 
Yale, Columbia and Princeton; and otherwise only at the University of Chicago 
and at Berkeley. There had been only 500 members of the American Mathematical 
Society, when now there are 10,000. A year’s collection of mathematical review 
was less than half what we now receive in one month. This small world was to 
change dramatically after the war, with the coming of European mathematicians, 
mostly German, who had escaped from Hitler’s tyranny or from the devastation of 
Europe. They were welcomed in America, to aid in the war as scientists, or as part 
of the generosity towards the world which Americans in those days so strongly 
showed. 

It was my good fortune to go to Indiana University at just this time, to study 
with a newly formed but impressive faculty, including among others Max Zorn, 
Eberhard Hopf and David Gilbarg. Clifford Truesdell arrived in Bloomington in 
1950, one year before I left, brought there to solidify the study of continuum me- 
chanics. The students were completely in awe of him, and indeed his lectures were 
amazing tours de force, which those who knew him later can easily imagine. That 
was the setting. 

Now let me turn specifically to the Archive, at first in Clifford’s words. 

“A mathematical journal of a different kind had to arise. It was only a question 
of persons, place, and time. In those circumstances, T.Y. Thomas, the Head of the 
Department, asked me to join him in founding a journal to serve the then growing 
fields of mathematical continuum mechanics and the analysis of nonlinear partial 
differential equations.” 

“Beyond the scope and editorial policy, there was some discussion about the 
unusual title ‘Rational Mechanics’, but in the end it was adopted because Newton 
had introduced it in his Principia and had not only exemplified it but defined it.” 

Even the form of the cover of the new journal occupied his mind - for what it is 
worth, my small contribution in this direction was to choose the color, red instead 
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of blue, to be used as the contrasting ink on the cover. Thus the JOURNAL of 
Rational Mechanics and Analysis was bom, complete with Latin inscription. 

Number 1 of Volume 1 of the Journal of Rational Mechanics and Analysis 
appeared in January, 1952. 

Clifford left Indiana in 1957 for Johns Hopkins, and the old Journal became 
the new Journal of Mathematics and Mechanics (since that time renamed again as 
the Indiana University Mathematics Journal). At the same time, a new periodical 
emerged from the ashes, the ARCHIVE for Rational Mechanics and Analysis. 
This time by the way, there was no problem in choosing a color for the cover, 
the publisher Springer- Verlag being inextricably committed to yellow, and yellow 
only. Both journals have of course flourished and arc still here, a tribute to the 
far-seeing eye and organizational ability of Truesdell. 

The early years of the Archive saw a great revival of mechanics as a rational doc- 
trine, with the contributions of Stuart Antman, Bernard Coleman, Jerald Ericksen, 
Roger Fosdick, Morton Gurtin, Daniel Joseph, Victor Mizel, Walter Noll, David 
Owen, Richard Toupin, as well as the celebrated analysts, Antonio Ambrosetti, 
Haim Brezis, Constantine Dafermos, RL. Lions, J.B. McLeod, Paul Rabinowitz, 
and others. The Archive had become necessary for every fine scientific library. 

It was thus fitting and natural that when Clifford Truesdell retired from the 
editorship in 1989, Stuart Antman was chosen to follow in the same tradition. The 
Archive requires special qualities of mind of its main editors - taste in subject and 
style, judgement of scientific merit, dedication to an ideal, sheer plodding effort 
- together with the ability to see a union between mathematical analysis and the 
physical or rational world. These arc talents which Stu had in abundance - and, 
during the time I was co-editor of the Archive, talents which I could draw on from 
Clifford Truesdell when required. 

The focus of the Archive has turned gradually in recent year's, in part due to 
changed research directions in rational mechanics and analysis, a change which af- 
fected both contributors and Members of the Editorial Board as well as the response 
of the chief Editors. 

Clifford viewed mathematics and mechanics as a type of ait, as part of our living 
culture. Here are the words of a famous Finnish composer - which seem equally 
relevant to the life of mathematics: 

“It is my belief that art is great if, at some moment, it catches ‘a glimpse of 
eternity through the window of time’ - if the experience is one which we might 
call ‘the oceanic feeling’. This to my mind, is the only true justification for all art. 
All else is of secondary importance.” (Einojuhani Rautavaara) 
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1. Introduction 

Allow me to recall that the young scientist, formed at Caltech and who had won 
acclaim through a series of articles on special subjects, soon acquired extraordinary 
fame through his comprehensive Handbuch articles, the second of which was writ- 
ten with Walter Noll, where they reformulated the foundations of the mechanics of 
continua. The mechanics of continua, which before had appeared in all textbooks 
as a conglomerate, even as a sandhill of sometimes isolated subjects between which 
there was no connection, logical or mathematical, became now connected and 
unified by a powerful system of axioms in the way Hilbert had postulated at the 
beginning of the century. 

This unification of the entire field of mechanics became possible, among other 
things, through the sharp distinction between dynamical principles and constitu- 
tive equations. The latter formulates the special properties of the materials only - 
something that cannot be deduced in classical mechanics, but must be left to quan- 
tum mechanics. In classical mechanics such a property must be formulated by an 
additional hypothesis. Noll and Truesdell then showed that this distinction could 
be traced back to Cauchy and from Cauchy to Euler and to Jacob Bernoulli. And 
thereby we have now stepped onto the domain of the history of science. 

But here we must now pause for a moment. While it is obvious that having 
worked out a systematic organization of mechanics is indeed an incomparable 
preparation for analyzing, ordering and understanding also the historical discov- 
eries and the various processes of the development of science, one cannot stress 
enough, on the other hand, that science and history arc two radically different 
endeavors of the human spirit. 

The essence of science lies in its property of being systematic since science 
ultimately always wishes to grasp the laws of nature, which it strives to uncover and 
to formulate in the simplest and most transparent form. But human history, and thus 
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also the history of science, is the complete opposite of this: it is totally unsystem- 
atic, always complex and never simple nor transparent. So, for writing the history 
of science two different, indeed totally opposite, endeavors must simultaneously be 
at work in the same man. Thus from the same man two almost irreconcilable gifts 
arc requested - gifts from his intellect as well as from his heart. This confrontation, 
one might say “clash”, of the endeavor to systematize and to extract the universally 
valid from the documents which the historian finds before him, with the aim to 
determine the conditions under which this, always unique, discovery was made, 
under very special circumstances and by one distinct individual different from all 
others, and then to interpret its significance for the development of science, is the 
character of the history of science. It is its very essence, even its unique prerogative 
and also its characteristic charm. 

Beyond and above all of the interesting facts which we learn and insights which 
we gain about the progress that we arc allowed to observe and learn to appreciate, 
it is precisely this constant confrontation which fascinates the reader of TruesdelTs 
articles and books, and there perhaps more so than for any other historian of science 
I know. No wonder then that the interest of such a scientist-historian is directed 
especially towards the great systematizers and unifiers, those who at the end of 
previous long developments could lay the definite foundations of a whole field, 
like Newton, Euler, Cauchy, and Maxwell, or prepare new ways like Hilbert did for 
quantum mechanics. But as we shall also see, Truesdell had a special admiration 
also for Jacob Bernoulli. Here I shall restrict myself almost exclusively to what 
Truesdell did for the Euler Edition, which was the cause of my first contacts with 
him, and later, which is a very different story, for the Bernoulli Edition. 



2, Truesdell’s Introductions to Euler’s Works on Hydrodynamics 

Truesdell was invited in the fifties by the mathematician Andreas Speiser, then 
General Editor of Euler’s Opera Omnia, to edit volumes 12 and 13, series II, which 
contain Euler’s writings on hydrodynamics. The great systematizer Truesdell was, 
I must remind no one, cut out for that job much more than Speiser could have 
known. Indeed, Truesdell’s introductions to the two volumes 11.12 and 13 offer 
much more than what their titles promise and what Speiser possibly could have 
hoped for. He traced the subject back to its origins, Archimedes for the hydrostatic 
and Torricelli for the hydrodynamical paid, and then presented the development 
of these branches of science in detail, from the second half of the 17th century 
throughout the whole 1 8th century and beyond. Truesdell presented this scientific 
development, where many strands lived for a long time their separate lives. Later 
they became more and more intertwined and now form whole domains, but in the 
unplanned and rationally never fully explicable way that history, which does not 
care much for its future historians, always proceeds. And eventually he showed 
how these separate domains came together and why they had to be formulated by a 
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new mathematical language - a language the very creation of which was stimulated 
largely by this development. 

In these two introductions we can see how the systematic understanding of a 
science from today’s point of view not only helps to unravel the various strands, 
but is indeed also indispensable for showing what exactly each strand and each 
domain has contributed, and thus for doing justice to each of them. Here it is not so 
much the experimental versus the theoretical progress that must be balanced; for, 
as Truesdell observed, because of the sudden enormous progress of mathematics 
during this period, theory was then mostly far ahead. Incidentally, this is perhaps 
the secret reason why this period was always, and still is today, so much neglected 
by the historians of physics. Rather, the difficulty is to attribute proper credit not 
only to the stimuli that comes from new, now suddenly accessible, problems of 
mechanics on the one hand but also on the other to the impulses due to the analyt- 
ical and, to a minor degree, geometric discoveries made around the middle of the 
century. 

First we have Newton’s heritage, deposited in the second book of the Principia. 
It is well known that precisely here arc some of the book’s most difficult, even 
obscure passages. Truesdell ’s summary of it and his evaluations arc extremely suc- 
cinct; they fill merely two pages. Even so they arc a very valuable help to the reader. 
This holds true especially for his careful separation of what Newton had derived 
by stalling with the corpuscular view from those results that he had derived using 
a continuum view and, finally, from those results for which he used both kinds of 
assumptions. The only regret the reader feels is that this whole section is so short. 

And then we find Truesdell's examination of Daniel Bernoulli's Hydrodynam- 
ica. This book contains, albeit in splendid isolation from the rest, the celebrated 
excursion into the kinetic theory of gases, as we say today, which Truesdell places 
in the proper historical perspective. Here, Bernoulli was over a century ahead of his 
time. But then in this book hydrostatics is unified with hydraulics. This was the first 
of the two big unifications of theories during this period. It was achieved through 
the celebrated Bernoulli equation, which, as Truesdell carefully explains, was by 
no means written by Bernoulli in the form in which we know it today. Indeed, not 
less than four major discoveries were needed before its transparent form, due to 
Euler, became possible. 

The first of the four discoveries just mentioned was Johann I Bernoulli’s intro- 
duction of Newton’s concept of force into hydrodynamics, if only in a one-dimen- 
sional setting. It was Truesdell who rediscovered this fact, acknowledged already 
by Euler, thus dealing Johann from the reproach of having plagiarized his own 
son. Later Szabo confirmed Truesdell’s vindication in even stronger terms. And 
Truesdell underlined that the roots of Euler’s hydrodynamics were to be found in 
Johann’s rather than in Daniel’s work. 

The second important discovery was Euler’s deeply penetrating understanding 
of what we call today an “inertial system” and, in particular, the fact that only in 
such a frame can one expect the laws of nature to have a simple form, a form that 




42 



D. SPEISER 



one can guess through mathematical or physical imagination. Truesdell calls this 
paper E177 “a research, that will change the whole face of mechanics”. This is 
the moment to mention the recent book by the Italian mathematician and historian 
Giulio Maltese in Rome: La storia di “F = ma”. This book deals with the devel- 
opment of particle mechanics from Newton to Euler and it is perhaps the first one 
to profit fully from Truesdell’s work. 

The third important discovery was Euler’s creation of the concept of “inner 
pressure”, as opposed to Stevin’s “outer pressure”. This new concept was the key 
to his equations. 

The fourth important step lies on the mathematical side of this development. 
It was the formulation of a field description due to d’Alembert, a discovery that 
again was first pointed out by Truesdell. And on the mathematical side, one can, 
of course, directly observe the greatest step due to the mathematics of the 18th 
century: the systematic development of partial differential equations, due again 
largely to Euler and to d'Alembert in their work on hydrodynamics. The interplay 
between these two lines of research were masterfully pursued and presented by 
Truesdell. If we think how loath scientists usually arc to recognize how much their 
own field owes to the progress of other areas, Truesdell's presentation is doubly 
remarkable and welcome. 

Thus the ingredients were together for the equations of hydrodynamics, pub- 
lished in 1752 by Euler in his paper E 258. Thereby, hydrodynamics and aerody- 
namics were unified into one dynamical theory, which can be applied equally to 
either of them by selecting a special constitutive equation. These equations mark 
more than only a period in the history of science: they represent the first field 
theory in physics! The Bernoulli equation appeal's now in the form we know it 
today, namely as an integral of the equations of motion, valid only under certain 
circumstances. 

To this culmination point in the history of mechanics there corresponds a culmi- 
nation in Truesdell’s narratives: his commentaries on Euler’s two comprehensive 
presentations of hydrodynamics. The first of these consisted of three papers written 
in the 1750’s in French, and the second was written in the 1760’s in Latin. The latter 
is part of Euler’s great project, conceived in 1734, to present the entire science of 
mechanics in 6 volumes. The first of Euler’s presentations reflects the moment of 
the discovery when it was still fresh in his mind, and the second, a more polished 
one, shows the ambition to present everything as clearly as possible, for beginners. 

I mention here only Truesdell’s comments on Euler’s paper E 331: “On the 
motion of fluids arising from different degrees of heat”, written in 1764. This paper 
belongs to the prehistory of meteorology. Euler pursued a line taken up 150 years 
later by Milankovitch, but Truesdell pointed out, and again he was here the first to 
do this, what this paper brought to thermodynamics. 

But he restricted his account by no means to the leading figures of history, 
even if they do receive the lion’s share, but others are not overlooked. Thus, the 
work of Simon Stevin, one of those who is always underestimated, received elo- 
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quently his due. Truesdell drew the reader’s attention also to Jacob Hermann, Jacob 
Bernoulli’s most important disciple next to his brother Johann. It will be an impor- 
tant task of the Bernoulli Edition, to scrutinize Hermann's works for important 
contributions. 

In the introduction to the second volume one finds Truesdell’s account of the de- 
velopment of acoustics. Besides Euler’s, two names stand out here: Daniel Bernoulli 
and Lagrange. I prefer to mention Bernoulli’s work later in connection with the in- 
troduction to the history of elasticity. Of Lagrange’s work, Truesdell gave a detailed 
account of correspondence with Euler as well as of Lagrange’s own published 
papers: he wrote that “the velocity-potential theorem and the impulse theorem arc 
first rate creative works and Lagrange’s greatest discoveries in fluid dynamics.” 

Lor organizing this vast amount of material, as one can appreciate, an intimate 
acquaintance with the sources is not enough. What was essential here was a deep 
insight into the science of mechanics itself - an insight that could be gained only 
from its most modern formulation. Such an understanding was essential to provide 
a systematic and powerful enough reference system for organizing adequately this 
enormous body of material. It is the combination of both of these qualities, which 
Truesdell possessed to the highest degree, which lends to his presentation relief 
and gives to the narrative at the supreme moments a dramatical quality. 

But let us return to Euler’s equations and their two great presentations. Lrom 
Euler these equations came to Cauchy, who, with the help of his new creation - 
the stress tensor - laid the foundation of the theory of elasticity in its definite form. 
During the same period the field idea, through Euler’s “Lettres a une princesse”, 
reached Faraday, who quoted them frequently in his journal, and from Cauchy and 
Faraday the concept of a field came to Maxwell. One may fairly say, that either 
of Euler’s great works, especially the Latin one, if supplemented with some of 
Truesdell’s comments which place them in a modern perspective, arc, or alas, as 
I rather must say, would still today be the best introduction to hydrodynamics at 
the level of high school, and even more so, at the level of university teaching. The 
presentation of the idea of what a law of nature is, how it works and what use is 
made of it under special circumstances including technical applications has hardly 
ever been surpassed. This presentation is never clouded by mere formalisms nor by 
going into the details of technological applications which, more often than not, just 
obscures the basic idea. By Euler and by Truesdell one is led from one essential 
point to the next with only the absolutely necessary excursions until one reaches 
the summit. 



3. Truesdell’s Introduction to Euler’s Work on Elasticity 

“The rational mechanics of flexible or elastic bodies 1638-1788”, as the title im- 
plies, reviews its subject from Galileo’s Discorsi to Lagrange’s Mechanique Anal- 
itique. But in fact it opens with a survey over the prehistory of the whole field of 
early Greek antiquity and also deals with the Middle Ages. Here we find such 
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Truesdellian pearls like the following first sentence: “Duhem's great historical 
studies showed that the apparent darkness of mediaeval physics is but darkness 
of our knowledge of it.” Indeed this was a call to fill an immense gap in science. 
I will mention here only one glaring hole: our ignorance of mediaeval technol- 
ogy. How, exactly, did the mediaeval architects and engineers proceed with the 
construction of their enormous cathedrals, especially of the towers, which surpass 
in height, refinement and daring everything that the Greeks and even the Romans 
had achieved? Truesdell’s clarion call ought to remind all historians of science as 
well as of the aits that they should direct their attention much more to the Middle 
Ages than they have done so far. He mentions explicitly the elusive Jordanus de 
Nemore and notes: “The only writing of value on deformable bodies that I have 
been able to see is the fourth book of Jordan de Nemore’s Theory of Weight 
(13th century), and remarkable it is, Western in spirit, ambitious beyond anything 
in the Greek or Arab tradition. The seventeen propositions on fluid flow, resistance, 
fracture and elasticity arc all original.” And the next section contains an evaluation 
of Leonardo’s achievements. 

The whole account tills the separate volume II, lib of 400 pages that contains 
the introduction to the two volumes II, 10 and 11! An immense achievement: 
theories, experiments, simple historical facts, etc. arc not only enumerated but 
thoroughly “digested”; that is, their scientific content is explained and imbedded 
into the historical development. To satisfy both requirements makes, of course, 
heavy demands on the writer. Add to this that the account is based not only on 
the published writings, but also on an enormous number of letters, for instance, on 
the quite complex epistolary exchanges between the Bernoullis and Euler, which 
Truesdell searched through. From these we can get an idea of the Herculean labor 
that he has gone through. And besides theories, we also find experiments dis- 
cussed - experiments done, among others, by Musschenbroek, Giordano Riccati 
and Chladni and the account reaches far into technology and engineering. 

Since the field of elasticity has a more complex structure than hydrodynamics, 
its history presents more isolated strands and subdomains. Thus, a lucid arrange- 
ment is here much more difficult to attain. Here I must restrict myself to a small 
paid of Truesdell’s accounts of elasticity proper and the notion of flexibility. I shall 
begin with the latter. 

While the modern theories of flexibility began with Galileo’s and Mersenne’s 
investigations of the string, it was only the investigation of static problems like 
the hanging cord and the suspension bridge that led science to ask for mechanical 
explanations. In 1712, Brook Taylor computed the fundamental frequency of the 
string. While Daniel Bernoulli, as we learn from Truesdell, had a bit later the theory 
of the overtones in his hand, he did not publish it. Instead he studied the double-, the 
triple-, the multiple-pendulum, and asked the following question, characteristic of 
his whole research: under what condition is the oscillation stationary? His answer 
was: a pendulum with n masses has exactly n stationary oscillations, and he com- 
puted up to n — 5 its n frequencies as a function of the various masses and length’s. 
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Turning next to the the hanging cord, by going to the limit he proved that it has 
infinitely many stationary oscillations given by the zeros of a new transcendental 
function, today denoted as the Bessel function / 0 (x). 

A little later Johann I Bernoulli formulated the problem of the motion of several 
particles attached to a string. Truesdell saw that in this field too he was the first to 
use Newton’s equation of motion. 

Then d'Alembert, who had studied Daniel Bernoulli's two papers, published 
in the Traite de Dynamique in 1743 a parti al differential equation for the hanging 
cord, the first partial differential equation in mechanics. Truesdell called this dis- 
covery “a turning point in the whole history of mechanics”. Then, in 1746, in his 
paper “Recherches sur la courbe que forme une corde tendue mise en vibration”, 
d'Alembert presented the differential equation of the string, this time together with 
the solution, and a short time later Euler published the solution found in a different 
way. 

With these discoveries began the last of the big scientific polemics of the 1 8th 
century. Truesdell was highly critical of the polemic itself, which he called “de- 
plorable”. He commented that it “confirms the principle that ever the greatest 
quantity of paper is smeared over with the dullest matter.” But this “great quan- 
tity” was searched through by him with great care, and the questions at issue were 
explained by him incisively. 

In this triangular struggle both d’Alembert and Euler maintained that only the 
partial differential equation would yield all solutions. Their quarrel concerned the 
class of functions that are admissible as solutions. While d’Alembert tenaciously, 
but erroneously, maintained that only the functions today called analytic can serve 
as solutions of a mechanical problem, Euler admitted a much larger class, the 
class of piecewise smooth functions. Hereby he achieved in Truesdell’s words 
“the greatest advance of scientific methodology in the whole century”, because 
it contradicted the Leibniz postulate that in mechanics functions must be analytic, 
a postulate which, according to Truesdell, had not been contradicted by anyone, not 
even by Newton. Bernoulli, however, claimed that the trigonometric series would 
equally well yield all solutions. As he had overlooked the arbitrary phases he was 
wrong, as we know today. The approach of his adversaries carried the day, but 
250 years later we can see that Bernoulli had in fact solved the first finite and 
infinite eigenvalue problems, which occupy now, after the past 75 years, the center 
of quantum mechanics. Thus arc the meanderings of the developments of science! 

All these problems, however, were merely 1 -dimensional ones. Only one 
2-dimensional problem from the field of flexibility was solved during this period: 
Euler discovered in 1759 the equation of the drum. 

The elaboration of the theory of elastic bodies moved during the same period on 
very different lines, especially with respect to the mathematics involved. Truesdell 
traced its modern development back to Beeckman and to Galileo. In the Discorsi, 
Galileo had asked: what is the proportion between the minimal weight Pj needed 
to break a beam simply by elongation and the minimal weight P, needed to break 
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it transversally when one of its ends is clamped into a wall? Galileo’s prediction 
was 



P, ■ P, — (b '■ a)N , 

where a and b are the length and breadth of the beam, respectively, and N is a 
numerical factor. By assuming a special model he then could compute N = 1/2. 
Mariottc, who tried to confirm Galileo’s predictions experimentally found N = 1/4 
rather than 1/2, and he wrote his results to Leibniz. Leibniz suggested in 1684 that 
one should take into account also the energy needed to bend the beam before it 
breaks. Using Hooke’s law he found N = 1/3. According to Truesdell this was the 
first computation which took account of the dilation of fibres. 

A few years later in 1687 Jacob Bernoulli asked Leibniz, in a letter, for ex- 
planations concerning his new calculus and in the same letter he included the 
results of his own experiments which in certain cases clearly disproved Hooke’s 
law. Leibniz, because of his absence on a trip, replied three years later. He sug- 
gested that Bernoulli should determine the exact form of a beam that is bent by a 
weight. Bernoulli at once set out to work on this problem. And indeed by using the 
principle of the balance of moments of forces and his own “golden theorem” - 
his formula for the radius of curvature - he found the solution in the form of 
an integral which contained an arbitrary function depending upon the constitutive 
relation between the stress and the strain. Here, in 1692, Bernoulli recognized the 
distinction between dynamical principle and constitutive equation, for his experi- 
ments had convinced him that there was no universal law valid for all materials. 
In the simplest case, namely when Hooke’s law is assumed, the solution is given 
by the famous elliptic integral, discovered and discussed by him in this paper. Of 
his theory of the bent beam Truesdell wrote (p. 96): “the deepest and most difficult 
problem yet to be solved in mechanics, is his alone.” 

Truesdell pursued Bernoulli’s later investigations on this problem, especially 
on the location of the neutral fibre. It ended, as Antoine Parent showed, with a 
failure. Truesdell remarked: “To the ironies and disappointments which filled [his] 
life must be added that while he originated or assembled all the apparatus sufficient 
to put [his final equation] on firm ground, he failed to do so, failed because his 
attempt was on too grand a scale.” In fact the problem of the neutral fibre seems 
still today not to be solved in full generality. 

Truesdell’s evaluation of Jacob Bernoulli’s achievements is: “In our epoch for 
study, 1638-1788, but one other, Euler, is to build himself a like monument in our 
subject.” On the other hand, it is characteristic for Truesdell that he devoted a full 
section to Parent, whom he rescues from near oblivion by showing that “Parent was 
the first to apply statical principles correctly to the tensions of the fibres of a beam, 
and that he recognized the existence of shearing stress.” These arc no small merits, 
indeed! 

The next step in this development was taken by Daniel Bernoulli. He knew that 
Euler was working on a book on variational calculus, and suggested to him a mini- 
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mum principle for the potential elastic energy stored in a curved beam. Euler imme- 
diately worked out its consequences which he annexed as the first appendix to his 
Methodus Inveniendi Lineas Curvas, where he derived a multitude of new results. 

Truesdell commented also on Euler’s further work in this field, e.g., his discov- 
ery of the shear force and Coulomb’s discovery of the shear stress. In this field 
all correctly solved problems were, again, 1 -dimensional. Even the problem of the 
oscillations of a massive plate was missed by the second Jacob Bernoulli, if also, 
as D.O. Mathuna showed, only by a hair’s breadth! Truesdell was here a bit severe, 
for the definitive solution was given by Lagrange only more than 30 years later. 

Especially impressive is Truesdell’s “modern evaluation”, which fills the last 10 
pages of the book. He divides the task into three parts: the evaluation of Analysis, 
Geometry and Mechanics. Who else could have dared to evaluate three basically 
so different histories? Even a careful and advised reader, I suspect, will discover in 
these few pages, here and there, something of what he believed to have well under- 
stood but had in fact failed to grasp a fundamental aspect. I repeat here the first sen- 
tences of the first summary. The triumphant lines show Truesdell’s rightful pride in 
his own beloved science: Rational Mechanics. He wrote: “Prior to 1730, researches 
on continuum mechanics applied mathematical techniques already developed in 
other subjects, notably in geometry and in the mechanics of point masses. Start- 
ing with the research on vibrating systems by Daniel Bernoulli and Euler, 
the situation was completely inverted. From then on until the end of the century, 
continuum mechanics gave rise to all the major new problems of analysis.” 

On the two last pages of the book Truesdell asked why the foundations to a com- 
plete theory of elasticity escaped this period and writes: “Neither physical intuition 
nor experiment was what was needed here; rather, as both Euler and Chladni 
said, it was want of differential geometry that blocked the way to theories of de- 
formable surfaces and solids”. And after mentioning that Euler had introduced all 
elements of the strain tensor in a paper on hydrodynamics, he notes on the very last 
page: “In surveying all these brilliant individual achievements . . . , we are driven to 
ask why, when Euler had succeeded in 1752 in creating a general theory of perfect 
fluids . . . , nevertheless after many more years he failed to reach a general theory 
of elasticity.” His answer was: “To succeed in hydrodynamics, the only hope lay 
in abandoning a one-dimensional approach. But for elastic or flexible bodies one- 
dimensional theories led to one triumph after another. It was the brilliant successes 
of the special theories that blocked the way to the general theory, for nothing is 
harder to surmount than a corpus of true but too special knowledge.” 

I could give here only an insufficient account of this monumental work of Trues- 
dell: the history of the theory of elasticity is now, probably due to him, the best 
charted and the best investigated domain of the history of physics. And I remain 
convinced that the three introductions to which I referred arc the best guide to 
a deeper understanding and further study of the history of classical mechanics 
and indeed of the history of science; every time I open one of them I find again 
something new and interesting that had escaped me. 
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4 . The Concepts and Logic of Classical Thermodynamics as a Theory of 
Heat Engines. Rigorously Constructed upon the Foundation Laid by 
S. Carnot and F. Reech 



Before turning to my second main topic, Truesdell’s work for the Bernoulli Edition, 
I would like to mention his book with S. Bharat ha. Even if it lies somewhat apart 
from the other works of this account, it brings several characteristics of Truesdell 
to the fore, which seem to me fundamental for his thinking as well as significant. 
I did quote the full title of the book, since this book brings together, like no other 
of his books that I know, science, history and, not surprisingly, conceptual logic. 
And in no other book of those which I know, is Truesdell so preoccupied with 
teaching. Not that I would recommend the book as a textbook for students, but 
I recommend it highly to all those who teach physics. The aim of the book is to 
construct a rigorous foundation of classical thermodynamics based on the idea of 
Carnot cycle. “Rigorous” refers here not only to mathematical rigor, but also, and 
in fact even more so, to conceptual rigor - to a clear and adequate introduction and 
a sharp definition of all concepts that will be used in the equations as well as a 
precise reference as to how they arc to be measured, i.e., how they arc connected 
with experiment. The very first sentence of the preface makes this clear and it is, 
at the same time, a “critique” in the sense of Kant of the possibility of writing 
the history of science. He writes: “I do not think it possible to write the history 
of a science until that science itself shall have been understood, thanks to a clear, 
explicit, and decent logical structure.” I have hardly found in TruesdelTs work pos- 
itive references to philosophy, and probably he would be surprised to be referred to 
as a philosopher; yet what he notes here and in the rest of the section is as important 
a contribution to the philosophy of science and to the philosophy of history of 
science as I have ever heard. Perhaps Truesdell, were he here, would react to this 
compliment with a little smile. On the other hand, the aim which he pursued as a 
historian is expressed by his dedication of the book “as an expression of respectful 
gratitude for the legacy of the great French thermodynamicists Carnot, Reech, 
Duhem”. This dedication incidentally refutes the accusation, which I heard some 
times, that his was an anti French bias. In three sections Calorimetry, Carnot’s 
General Axiom, and Universal Efficiency of Ordinary Carnot Cycles, the results 
arc presented. 

I believe that the progress of science consists in establishing connections be- 
tween various phenomena, between phenomena and their measurements, and it 
includes a process called the formation of theories, i.e., constructing connections 
between different restricted theories by erecting greater and even more comprehen- 
sive theories. The significance of the book then lies perhaps in the first place, that 
no other book of those that I have consulted connects thermodynamics so cogently 
and intimately to classical mechanics. 

But the book has another distinction too. It is a book written especially for the 
teacher, I dare say even for the teacher, who must speak to beginners. In other 
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words, the book has also a pedagogical aim. If it is not directly a textbook, this is 
only because the authors wished to prove that their approach is powerful enough for 
coming to grips with all situations that the practical applications demand. Hence the 
careful analytical generalizations to cases where functions that arc only piecewise 
smooth arc needed, etc. But for the explanation of the thermodynamical principles 
themselves, these technical details arc not necessary and can easily be suppressed 
by the teacher. But what the teacher can learn and teach above all is not so much 
the mathematical rigor, but the conceptual rigor of the theory or of any theory for 
that matter, and the importance of a careful introduction and explanation of all 
concepts. The importance, for instance, of the eternal question that looms over the 
beginning of all introductory courses on mechanics: “what exactly is now the force, 
professor”? 

When I think of my own lecturing, my greatest regret is that I concentrated too 
little and too late on the careful introduction of all concepts used in physics, and 
that I spent in my lectures too little time on their discussion. It is with the help of 
sentences that we prescribe the setting of a reproducible experiment - the concepts 
connect the experiments with the mathematical formalisms. 

Another fundamental point made clear in this book is that all theories arc always 
valid only with respect to a certain domain of the variables and under certain re- 
strictions. It is the neglect of these caveats which makes possible only pseudophilo- 
sophical and pseudoscientific generalizations. Here teachers can learn much that 
will prevent a certain boastful offer of their merchandise and at the same time make 
the understanding of what they present easier since it is focussed. I learned myself, 
for example, that the teacher must immediately at the beginning of each course 
enumerate explicitly all restrictions under which the predictions of the theory only 
arc valid. 

Over 20 years ago I had invited Truesdell to Louvain-la-Neuve for giving a 
series of lectures. In the first one he outlined the content of this book in one hour, 
overestimating, of course, his audience, which was oriented mainly towards quan- 
tum mechanics and its applications. And then he changed to other subjects. But he 
presented me with a copy of the book, and when I had read it, I regretted deeply 
that not the whole series of lectures was directed to this one topic, but I did not 
dare say it to him. But later, at a lunch I told him, that I was particularly impressed 
by one special topic, namely his treatment of the anomaly of water between 0 
and 4 degrees, of which I never had seen an adequate presentation. He then said 
approvingly, “I can tell you that this subject was my special goal for writing this 
booklet”, and then taking his glass he invited me to call him henceforth “Clifford”, 
which from here on I shall also do in this discourse! 

But during his stay in Louvain-la-Neuve there was one other topic towards 
which many of our conversations were directed again and again. This was the 
Bernoulli Edition, to which I shall now turn. 
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5. Truesdell’s Contribution to the Restart of the Bernoulli Edition 

This other topic which occupied Clifford and me was precisely the new beginning 
of the Bernoulli Edition, and here I must now go back a few years. Clifford had 
been involved with the Euler Edition, as I mentioned already, through the math- 
ematician Andreas Speiser, an uncle of mine, with whom I had close contacts. 
My uncle was extremely proud of this acquisition and spoke to me often enthu- 
siastically about Clifford. He gave me separata of the two introductions to the 
hydrodynamical works, and my uncle’s enthusiasm caught on also with me. 

I made Clifford’s acquaintance in 1957 on the occasion of Euler’s 250th an- 
niversary, where at my uncles invitation he was the main speaker at the official 
university ceremony. A few years later he wrote to me a complimentary letter for 
my own introduction to Euler’s works in the domain of physical optics. Meanwhile 
I had begun to read his introductions, so that when J.O. Fleckenstein asked me to 
succeed Hans Straub as the editor of the works of Daniel Bernoulli, I said “Yes, 
but . . .”. Namely, I stated as a condition that I should be paid the equivalent of a half 
time assistant. It so happened that a few month's earlier a young student had asked 
me if she could write a Ph.D. thesis under my direction. I wished to accept her, 
for she had definitely “une tete bien organisee”, but no post seemed free. Hence 
my proposal to Fleckenstein. I succeeded in persuading the student to work on 
the history of science, although at first she found this a puzzling and somewhat 
dubious proposition, and Fleckenstein arranged the financial side. Today, some 
twenty-seven years later, this young student, Patricia Radelet-de Grave, is professor 
at the Universite Catholique de Louvain, where she teaches the history of science. 
She has now succeeded me as editor of the Bernoulli Edition while an Italian Ph.D. 
student of hers served as the Edition’s secretary. 

Studying Clifford’s introductions, I had become convinced that he was the best 
possible guide and counsellor for launching the whole enterprise again. In 1975, 
I was in New York when I received a call from Clifford who inquired about what 
was going on in the Bernoulli Edition. I gave him the little information I had, 
but only later did I find out that he had not been terribly excited by my answers. 
Nevertheless, later, as I shall mention, he accepted an invitation to be the edi- 
tor of Daniel Bernoulli’s work on hydrodynamics. To my surprise he wrote to 
me that he was not satisfied with his earlier work in the Euler Edition. His, as I 
noted before, was the only critical voice about these introductions, of which I ever 
heard. 

In 1980, when I succeeded Fleckenstein as the Editor of the whole Edition, 
he encouraged me to ask Andre Weil to become an editor of the works of Jacob 
Bernoulli. Weil accepted very kindly first the volume Analysis and then later also 
the volume Differential Geometry. Also from Weil I learned much on the ait of 
editing, and I wish to state here too that I cannot remember one difficult moment 
with him. A bit later Weil introduced me to Herman Goldstine, who edited the 
volume on Variational Calculus containing works of Jacob and of Johann. Mean- 
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while Mrs. Radelet and I produced plans for editing the works of all Bernoullis; 
so far there had not been any plans nor any reliable estimates of the work at 
all, the old ones being all much too low. Precisely all these questions and many 
more, including the choice of the typography and the design of the volumes, were 
then discussed with Clifford in Louvain-la-Neuve. He took a detailed interest in all 
problems and Mme. Radelet and I learned much from him. 

At last we could publish the plans, which were printed in 1982 in an illustrated 
brochure, which also contained a presentation of the Bernoulli family and the 
importance of each member. The plans consisted of (i) a presentation of the whole 
project, including what had already been achieved and the distribution of the works 
into volumes and (ii) a determination of our priorities: first to complete what was 
begun, i.e., Jacob I and Daniel Bernoulli and three volumes with the letters of 
Joh. IB.; this first stage is now approaching its completion. Then only to complete 
in a second stage all works of Joh. I and the “minor” B.'s; this stage is now opened 
with the deposition of Vol. 8 of the works of Joh. I by P. Villaggio from Pisa. 
A third stage was forseen for the letters; for their publication a project was worked 
out by F. Nagel and myself. Our plans also contained (iii) a list with the names of 
all editors of the first stage. At first, when I wrote to Clifford about this brochure, 
he did not seem terribly impressed, but when I sent him two copies he sent me his 
enthusiastic congratulations together with a list of friends and colleagues whom he 
invited me to send a copy too. 

These plans, with the appointment of editors for the first stage, were the basis 
of the 1982 restart of the Bernoulli Edition. The same year, on the occasion of 
the bicentennary of Daniel Bernoulli’s death, the Curatorium, under its president 
the historian A. Gasser, organized a Symposium. The main speaker was Clifford, 
who in the “Alte Aula” between the portraits of Daniel Bernoulli and Euler gave 
a speech about the research of both on the theory of oscillations, evaluating the 
strengths and the weaknesses of both of them. 

This was the beginning of a series of exchanges concerning the progress of the 
Edition, and especially of my requests of Clifford's opinion on various questions. 
At the Symposium we presented the first new volume edited by L. Bouckaert and 
B .L. van der Waerden. It received, besides several favorable reviews, one that put 
the Edition at its start into serious trouble. Again Clifford came to our rescue and 
refuted in a letter to the Swiss National Science Foundation (SNScF) all points, 
with the exception of one, of the review. His letter especially restored, as I was told, 
the confidence of the SNScF. Incidentally, the author of the review later graciously 
apologized to Mrs. Radelet. 

Of course, we had all very much hoped that Clifford might deposit the two 
volumes containing Daniel Bernoulli’s work on hydrodynamics. This was not to 
be. As all know, an enormous load of work kept him busy beyond his forces, and 
his health eventually failed him, although he was in the best of care. He had to 
resign from his engagement, and he advised me to invite the Russian Academician 
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Gleb Mikhailov to take over. I am happy to report that recently we began with the 
printing of the first of the two volumes! 

But, thanks to his friend Andre Weil, Clifford, nevertheless, became a Bernoulli 
Editor! Indeed Weil had advised me to edit completely all letters of Jacob Ber- 
noulli’s correspondence with Leibniz and I had unhesitatingly accepted his advice, 
ignoring the existence of an agreement between the Leibniz and the Bernoulli 
Edition, which had left the editing of the letters with Leibniz to her sister-edition 
in Hannover; these letters were, of course, the most interesting ones. But when I 
explained the situation to our colleagues of the Leibniz Edition, they very gener- 
ously accepted our plans, provided we would not undertake a “critical edition”. 
During his work on the edition Weil persuaded Clifford to write an introduction to 
the parts of the correspondence that dealt with questions of mechanics. It is there 
that Leibniz drew Bernoulli’s attention to the problem of the curved arc. The result 
of Weil’s invitation was again the appearence of a very penetrating introduction. 

Thus the Edition is fortunate that Clifford’s name will remain connected to it, 
and especially as an editor of Jacob Bernoulli, for whom he had done so much. But 
even without this turn of events, after 30 years of work for the Bernoulli Edition, 
I can state firmly, that no one has done more for making the new beginning of the 
Edition in 1982 possible than Clifford Truesdell. 



6. Truesdell the Artist and the Man 

My report on Truesdell’s work for the Euler and the Bernoulli Edition must end 
here, although I could go on at length. But there remains a question: neither Clif- 
ford’s scientific expertise nor his penetration into history alone can explain the 
full fascination which his works exert on their reader. We know that all scientific 
theories arc even at best only approximations to the observed world and the same, 
but even more so, holds for the reconstruction of the historical path on which they 
were founded. Clifford, more than most historians was aware of this. All too often 
we must be satisfied with guesses. Thus, like the scientific theories, the historical 
reconstruction to some extent always remains a construct. So, what then produces 
the great satisfaction that we experience when we read Truesdell’s works? 

Here, for discovering the answer we must, I think, turn to another field: we 
must enter another dimension - the realm of beauty. The man, who was so much 
attached to all aits, music, painting, old books, etc., was himself also an artist. 
He has composed his books, in the double sense of this word. As much as his 
search for scientific precision in all details would allow it, his books arc beautifully 
constructed! 

This brings me necessarily to Clifford the private man. Everyone who had the 
intense pleasure to be received in the Palazzetto knows what I mean: the carefully 
chosen objects of the collection, their carefully thought out presentation, and es- 
pecially their owners’ passionate interests in all aids and also in the history of the 
aids. Here too one could experience the truly enlightening comments which their 
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guests received. One could watch how they stimulated through their interest in all 
crafts the artisans of Baltimore who made the Palazzetto into what it became. As 
you realize, I slipped now, almost unconsciously, into the plural: the treasures of 
the Palazzetto were offered to its guests by a couple! And so this was more than 
only their home! 

Can we imagine Clifford’s tremendous outpourings without the constant and 
intense help of Charlotte? Can we imagine this without her painful proof reading 
of his books, her corrections, her meticulous improvements of the last details, her 
conscientious organization of the Archive as well as the classification of Clifford’s 
correspondence in a private archive? Of course we cannot. Just as the Palazzetto’s 
hospitality was the work of both, the Palazzetto’s soul was Clifford and Charlotte. 
And Charlotte made sure that in spite of his harsh afflictions Clifford could spend 
his last years there in dignity. For this, all of Clifford’s friends will always be in her 
debt, and they will remain grateful to Charlotte and Clifford. 
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In memoriam Clifford Truesdell 



1. Introduction 

A general version of a conservation law with one spatial variable .v is a system of 
partial differential equations of the form 

u r = g(u, s), + h(u, s) (1.1) 

where u is an n -tuple of unknown scalar functions of s and t: g and h are given 
functions with values in M" ; and derivatives are denoted by subscripts. Suppose that 
g is differentiable and that h is continuous. In this case, this system is hyperbolic if 
the matrix 3g/3u of partial derivatives of the components of g with respect to the 
components of u is positive-definite. As is well known, such nonlinear hyperbolic 
conservation laws admit shocks. In both analytic and numerical studies of (1.1), a 
central role has been played by modifying it by appending to its right-hand side 
an artificial viscosity term [D(u, s) ■ u v | v where D(u) is a small positive-definite 
matrix (often taken to be constant and diagonal. See [4], Here D • u v denotes the 
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image of the n -tuple u v under the matrix D.) For problems of continuum physics, 
the effect of D is to modify the material properties characterized by g and h. 

When we introduce such modifications into conservation laws from physics, 
it behooves us to determine if the modified system has physical significance. If 
not, the modification might introduce serious analytic or numerical errors. (This 
happens when some standard numerical methods for the treatment of shocks arc ap- 
plied to the equations for the planar motion of elastic rods [ 3 ].) When ( 1 . 1 ) is a set 
of equations of continuum mechanics, there arc three sources of difficulty: (i) Vec- 
torial geometry is suppressed because the unknown u is just an n -tuple of scalars, 
typically components of vectors with respect to some moving basis with the basis 
carrying most of the geometric information, (ii) The system typically includes both 
momentum equations and compatibility equations, the latter expressing the equal- 
ity of mixed s and t derivatives. It is necessary to determine what physical meaning, 
if any, inheres in adding dissipation to such compatibility equations. (Slemrod [ 5 ] 
first treated this question for a scalar equation, in which the issue of invariance 
under rigid motions does not arise.) (iii) The form of the artificial viscosity D • u s is 
inspired by the viscosity term for 1 -dimensional gas dynamics and more generally 
for that in the Navier-Stokes equations. These equations arc typically given in 
a spatial (Eulerian) formulation. This form of the dissipation is not preserved in a 
material (Lagrangian) formulation, typically used for problems of solid mechanics. 
Indeed, we shall see that in the material formulation, the artificial viscosity D • u s 
corresponds to constitutive equations that arc not invariant under rigid motions. 

The purpose of this paper is to resolve these difficulties, producing properly 
invariant dissipative mechanisms suggested by this artificial viscosity. We shall see 
that such dissipative terms have a rich and attractive structure. 



Notation. We employ Gibbs notation for vectors and tensors: Vectors, which arc 
elements of Euclidean 3 -space E 3 , and vector-valued functions arc denoted by 
lower-case, italic, bold-face symbols. The dot and cross products of (vectors) u 
and v arc denoted u ■ v and u x v. The value of tensor A at vector v is denoted A ■ v 
(in place of the more usual Av). Twice-repeated lower-case Latin indices except 
for the independent variables s and t arc summed from 1 to 3 and twice-repeated 
lower-case Greek indices arc summed from 1 to 2 . 

Triples (wi, w 2 , n 3) of components of any vector u with respect to a certain 
nonconstant right-handed orthonormal basis {dk} arc denoted by the corresponding 
lower-case, sans-serif, bold-face symbol u. In view of the orthonormality of this 
basis, we set u • v := UjVj (= u ■ v), |u| = ^/u^ul (= |«|), u x v := (112V3 — U3V2, 
u 3U1 — u 1U3, u 1V2 — u 2 V\) (so that the first component of u x v is (u x v) ■ d\, 
etc.). The matrix of a tensor A with respect to this basis is denoted A. Its action on 
a triple u is denoted A ■ u. The dot product and norm for other n -tuples arc treated 
analogously. 

The (Gateaux) differential of u i-> / («) at v in the direction h is (d/d t) f (v + 
th)\ t =o- When it is linear in h. we denote this differential by ( 3 / / 3 u)(v) ■ h or 
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fuiv) • h. We occasionally denote the function u i— /(«) by /(•)• The partial 
derivative of a function / with respect to a scalar argument t is denoted by either f, 
or 3 ,/. The operator 3, is assumed to apply only to the term immediately following 
it. We shall always use notation like d, for a total derivative, i.e., a derivative of a 
composite function. Obvious analogs of these notations will also be used. 

2, Formulation of the Governing Equations 

We briefly outline the formulation of geometrically exact equations governing the 
motion in space of a rod that can suffer flexure, extension, torsion, and shear. We 
follow [1, Chapter 8], which should be consulted for interpretations and for the 
proofs of all our assertions. 

The motion of a rod is defined here by three vector- valued functions 

[0, 1] x I 9 (j, () h r(j, (), d\(s,t), d 2 (s,t)e'E ? ' (2.1) 

with \d] ( s , t). d 2 (s, 0} orthonormal. The function r (•, t) may be interpreted as the 
configuration at time t of the curve of centroids of a slender 3-dimensional body. 
The vectors d\{s, t) and d 2 (s, t) may be interpreted as characterizing the orienta- 
tion of the material section at s at time t. In particular, d\ (s, t) and d 2 (s, t) may 
be regarded as characterizing the configurations at time t of a pair of orthogonal 
material lines of the section s. We assume that s is the arc-length parameter of the 
reference configuration of r and we scale the length so that 0 ^ s ^ 1 . We set 

d 3 := d\ x d 2 . (2.2) 

Since {d k (s, 0} is a right-handed orthonormal basis for E 3 for each ( 5 , t), there arc 
vector-valued functions u and w such that 

3 s d k = u x d k , d,d k = w x d k . (2.3) 

Since the basis \d k } is natural for the intrinsic description of deformation, we 
decompose relevant vector-valued functions with respect to it: 

v r s — v k d k , p := r, = p k d k , u = u k d k , w = w k d k . (2.4) 

The equality of mixed partial derivatives of r and of the d k implies that 

p s = v, = (d,v k )d k + w x v, w s = u, + u x w — (d,u k )d k . (2.5) 

We set 

u := («i, u 2 , w 3 ), v := (iq, v 2 , v 3 ), 

(2.6) 

p := (pi, p 2 , p 3 ), w := (w u w 2 , w 3 ). 

u and v are the strain variables corresponding to the motion (2.1). (The strains 
u\ and ii 2 measure flexure, u 3 measures torsion, iq and v 2 measure shear, and v 3 
measures dilatation.) 
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In the configuration at time t, the resultant contact force and contact couple 
exerted by the material of (5, 1] on the material of [0, 5] (for 0 < s ^ 1) are 
respectively denoted n (v , t) and m(s, t). Provided that there arc no body forces or 
couples, the equations of motion have the form 

n s + / = pAr tt , (2.7) 

m s + r s x n + / = d,(pJ pq w q d p ) =: d,(pj ■ w) (2.8) 

where (pA)(s) is the prescribed positive mass density per reference length at s, 
the (pj y s)(s), y, S = 1,2, arc the prescribed components of the positive-definite 
symmetric 2x2 matrix of mass-moments of inertia of the section s. The positive- 
definite symmetric 3x3 matrix p J := ( pJ pq ) is defined by pj y 3 = p/ 3}/ = 0, 
p J 33 — P Jyyi and p J . — p J pqdpdq . 

Let 

m k := m ■ d k , n k := n d k , 

m := (mi, m 2 , m 3 ), n := (n\, »2, H3). 

m\ and m 2 arc the bending couples, m 3 is the twisting couple, n \ and «2 arc the 
shear forces, and n ■ r s /\r s \ is the tension. 

The rod is elastic if there arc constitutive functions 

(u, v, s) m(u, v, s), n(u,v,s) (2.10a) 

such that 

mfi, t) = m(u(5, t), v(s, t), s), etc. (2.10b) 

This form of the constitutive equations ensures that the material behavior is invari- 
ant under rigid motions. 

For any function [0, 1] x R B (5, t) h* z(.v, t) we define z SJ (o, x) := z(s — cr, 
t — r) for all a such that 5 — a € [0, 1] and for all x f t. The general constitutive 
equations for a rod whose response at (5, t) depends nonlocally on other material 
points (sections) of the rod and depends upon the past history has the form 

m(s, t) — m(u s,r , \i s,t , s), etc. (2.11) 

These very general constitutive equations are likewise invariant under rigid mo- 
tions. 

We now recast our governing partial differential equations as a vectorial system 
of first order in the time derivative. Equations (2.3)-(2.5), (2. 10), (2.11) imply that 



d,d k = w x d k , (2.12a) 

u, = w s — u x w, (2.12b) 

v, = p s , (2.12c) 

3 t (pJ ■ w) = 3 s (m k d k ) + v x h k d k , (2.12d) 

pAp, = d s (h k d k ) (2.12e) 
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where the arguments of m k and h k arc 

u ■ di, v ■ di, w s di, p s ■ di — (w x v) ■ di, s. (2.12f) 

Note that the ordinary differential equation (2.12a) preserves the dot products d k di 
and therefore ensures that {d k (s, f)} is an orthonormal basis for all s, t if {d k (s, 0)} 
is an orthonormal basis for all s. By the continuation theory for ordinary differ- 
ential equations, the lineality of (2.12a) and the constancy of d\ ■ d\, . . . imply 
that the solutions of initial-value problems for (2.12a) arc defined for all t. The 
system (2.12) is hyperbolic if (m, n) satisfies the monotonicity condition that the 
matrix 

3(m, n) 

is positive-definite. (2.13) 

3(u, v) 

If we take the componential version of (2.12) with respect to the basis {d k }, we 
can uncouple the equation (2.12a) for the d k from the remaining equations: Let e k i, n 
denote the alternating symbol. Then (2.12a) has the componential form 



dtd k — € k j jW jd { , (2.14) 

and system (2.12b-f) is equivalent to 

d,Uj = d s Wi — eij k w /U k , (2.15a) 

9? Vi = d s pi + e ijk (uj p k - wjv k ), (2.15b) 

dt(pJijWj) = d s mi + e ijk (ujfn k + Vjh k - WjpJ kq w q ), (2.15c) 

3 ,(pApi) = d s hi + e ijk (ujh k - WjpAp k ), (2.15d) 

where the arguments of m k and h k arc 



u, v, w v — w x u (= u r ), p s + u x p — w x v (= v r ), 5 . (2.15e) 

We can write this system in the compact form 



u, = w s — w x u, (2.16a) 

v ; = ps f u x p — w x v, (2.16b) 

d,{pS • w) = pj • w, = 3sfn + uxm + vxn — wx (pj • w), (2.16c) 

3,(pAp) = pAp, = 3 S n + u x n — w x (pAp). (2.16d) 



(Even though (2.16) is independent of the d k , boundary conditions, which we do 
not study, may not be.) 

The systems (2.12) and (2.16) are in a general conservation form. It is important 
to note that our original system (2.3)-(2.5), (2.10), (2.11) of governing equations, 
the first-order vectorial system (2.12), and the first-order componential system 
(2.16) are each equivalent. 
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з. Artificial Viscosity 

We now modify system (2.16), which we identify with (1.1), by adding an artificial 
viscosity D • u s , where, for simplicity, we take D to be a constant positive-definite 
diagonal matrix. In particular, let U, V, W, P be constant positive-definite diagonal 
3x3 matrices. Then the modification of (2. 16) with artifical viscosity has the form 

u f = w s — w x u + U • u SJ , (3.1a) 

v, = p s + u x p - w x v + V m ss , (3. lb) 

pJ ■ w ; = 3 s m + uxm+vxn-wx (pJ • w) + W • (pJ ■ w s ) s , (3.1c) 
pAp r = 3 s n + uxn — wx (pAp) + P • (pAp. s ) s . (3. Id) 

Suppose we were to decompose u and v with respect to \d k } as above, but 
decompose w and p with respect to a Cartesian basis {i k \. It is a straightforward 
exercise to show that the modification of the resulting system with artificial viscos- 
ity as in (3.1) is not equivalent to (3.1) (and therefore these two systems could have 
solutions with very different properties). This is a portent of some of the difficulties 
we must overcome. 

Modification of the momentum equations. We would like the artificial viscosity 
terms in (3.1c,d) to represent a material dissipation, which would regularize the 
behavior of solutions. Thus they should modify the constitutive functions in these 
two equations. If these modified constitutive functions are to be invariant under 
rigid motions, then the remarks surrounding (2.11) imply that W • pJ ■ w s and 
P • p v should depend solely (although possibly nonlocally in space and time) on 

и, v. It follows from (2.5), which gives the actual kinematical relations, that these 
viscosities lack the requisite form. It is also clear - from (2.11) how to rectify this 
deficiency. A particularly simple way to do this is to drop W • pJ • w s and P • p v 
from the right-hand side s of (3. lc) and (3. Id) and to replace (m, n) of (2. 10b) with 
(m + M • u f , n + N • v r ) where M, N are constant positive-definite diagonal 3x3 
matrices. Then in place of (3.1c,d) we obtain 

pJ ■ w, = 3 s m + M • u„ + u x (m + M • u,) 

+ v x (n + N • v,) — w x (pJ ■ w), 
pAp r = 3 s n + N • M st + u x (n + l\l • v f ) — w x (pAp) 

These equations are equivalent to the following modifications of (2.12d,e): 

(pJ ■ w), = {[, n k (u , v) + M k id,ui\d k } s + v x [h k { u, v) + N u d t Vi\d k , (3.4) 
pAp, = {[« fc (u,v) + N k ,d t vi\d k } s . (3.5) 

Modification of the compatibility equations. We now examine modifications like 
(3.1a,b) of the compatibility equations (2.16a,b). Since such modifications are crit- 
ical for numerical methods, we cannot avoid studying them by simply setting 



(3.2) 

(3.3) 
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U = O = V, as we would do in the analysis of the differential equations for 
viscoelastic rods of strain-rate type. We first have to frame a notion of invariance 
for modified compatibility equations, which come from purely kinematic consid- 
erations and have no intrinsic material significance. We define an invariant system 
with artificial viscosity to be a system of equations with single time-derivatives on 
the left-hand side and with each equation containing a dissipative term such that 
it is equivalent to the system consisting of momentum equations (2.7), (2.8) and 
constitutive equations of the invariant form (2. 11). Thus to study this issue, we have 
to reconstitute the governing system of equations of motion in their traditional form 
involving second time-derivatives from a suitable modification of (2.16) involving 
first time-derivatives. In the process of constructing an invariant modification of 
(2.16a,b) we show that (3.1a,b) arc not invariant. 

Rather than modifying (3.1a,b), it is more convenient to modify (2.12a-c). Let 
U and V be tensors whose matrices with respect to the basis [clfi arc U and V. We 
first consider a modification of (2.12c) in the form 

v t = p s + [( pA)~ x V ■ v,|. s + [(Mr'i/L (3-6) 

where rj is a function at our disposal to make this equation invariant. Since this 
equation sets a / -derivative equal to an s -derivative on a simply-connected domain, 
there is a vector-valued potential, which is convenient not only to denote as r but 
also to treat as r, such that 

r t - p + (pA)~ l V ■ v s + (pA)~ l r), r s = v. (3.7) 

The treatment of the analogous modification of (2. 12b) is a little trickier: We 
seek a | so that 

u t = \w + ( pjy 1 • (U ■ u s + £)L - u X [mi + ( pjy 1 • ( U ■ u s + I)] (3.8) 

is invariant. Although this equation does not have the form of a /-derivative equal- 
ing an s -derivative, it does have the form of (2.5)2- We therefore conclude that there 
is a triple of vectors, which is convenient to denote as {d k }, such that 

3 tdk = [U! + (pjy 1 ■ (U ■ u s + I)] X d k , 3 s d k = u x d k . (3.9) 

Since each of these equations conserves the dot products {d k ■ d/}, we see that these 
{d k } arc orthonormal if they arc orthonormal at the initial time. Note that (3.9) i is 
a modification of (2.12a). 

We replace p in the modified momentum equation (3.5) with its expression 
coming from (3.7) and we now identify the d k appealing there with the new vectors 
satisfying (3.9). We obtain 

pAr tt = (V ■ v s ), + J h + {[n*(u, v) + N k id,v,]d k } s . (3.10) 

For this equation to have the requisite invariance, the first two terms on the right- 
hand side must have the form [n k d k ] s where the n k depend (possibly nonlocally) 




62 



S.S. ANTMAN 



on (u, v). (The first term on the right-hand side lacks this form because the time- 
differentiation of the base vectors given by (3.9)i introduces w-terms not of this 
form.) We make a particularly simple choice of by choosing rj so that 

(V ■ v s ) t + r), = (V kl d t v,d k ) s . (3.11) 

The choice (3.11) gives (3.10) an invariant form: 

pAr n = {[n k ( u, v) + (N kl + V u )d t v,]d k } s (3.12) 

(where we use (3.9) to compute the derivatives of the d k ). We differentiate (3.6) 
with respect to t and insert (3. 1 1) into the resulting equation to get 

v„ = p s , + [(pAr'(Vud t v,d k ) s L (3.13) 

where again we use (3.9) to compute the derivatives of the d k . 

The principal paid of the partial differential operator acting on v in this equation 
is 



Vrr - v • [(pA)“'v If ] s . (3.14) 

It is just a vectorial version of the heat operator on v ( . It is responsible for the 
dissipativity and it gives this equation a parabolic character and gives the entire 
system a parabolic-hyperbolic character; see Zheng [6]. Note that the d k in (3.13) 
depend upon §, which has not yet been fixed. 

We now make (3.8) invariant by the same process by which we made (3.6) 
invariant. Since (3.8) comes from (2.12b) by replacing w with w + ( pj)~ l ■ ( U ■ 
u s + §), we make this replacement in (3.4), use the modified constitutive functions 
from (3.12), and interpret the d k in (3.4) as the new d k : 



(. pJ w ), = (U ■ u s + %), + {[m k (u,\i) + M k id,ui]d k } s 



+ V x [ h k (u , v) + (N k , + V k i)d,vi\d k , 


(3.15) 


To put this into invariant form, we simply choose § so that 




(U • u s + £) r — [U k jdfU[d k ^ s . 


(3.16) 


Thus (3.15) becomes the invariant 




(pj-w), = {[m k (u,\i) + (M k , + U, q )d t u q ]d k } s 

+ v x [ h k (u , v) + (N k , + V kl )‘d t vi\d k , 


(3.17) 



where once again we use (3.9) to compute the derivatives of the d k . 

Equation (3.16) may be regarded as I i near ordinary differential equation for §: 

£ t — U k i\($tUidk)s ( 4 , u i d k ) t ] 

= U kl {(d,uiu - d s u,[w + ( pj)~ l ■ ( V ■ u s + £)]} x d k . 



(3.18) 
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(Had we replaced £ by pj ■ § in (3.8), then the left-hand side of (3.18) would 
be ( pj ■ £), = pJki(£idk) t , and (3.18) would not be linear in the components of 
£ because the /-derivative of df- in this expression would introduce another factor 
involving the §/). 

We regal'd the modified compatibility equations (3.8) and (3.18) as a system for 
u and §. We cannot eliminate § from (3.8) as we did r] from (3.6) in getting (3.13). 
(We may regard £ as an internal variable.) Nevertheless, if we differentiate (3.8) 
with respect to t and then substitute (3.16) into the resulting equation where pos- 
sible, we obtain an equation for which the principal part of the partial differential 
operator acting on u is 

u„ - (pJ ) -1 • U • u m , (3.19) 

which has the same character as (3.14). 



4. Comments 

It is clear that we could have replaced all the positive-definite diagonal matrices 
appealing in the above development with positive-definite symmetric matrices. 
(Numerical methods typically do not require even this sophistication.) Indeed, 
we could introduce artificial viscosity through nonlinear constitutive laws, which 
would just give another source to the quasilinearity of the governing system. In 
short, there is no unique way to introduce invariant versions of artificial viscosity. 
The following development shows that if we make a slight change in our proce- 
dure, then the alternative invariant versions of the momentum equations (3.4), (3.5) 
have higher derivatives with respect to s corresponding to constitutive functions 
depending on (u s , v j also. The resulting theories are of strain-gradient type. 

Consider the pair (2.12c,e) of equations. Suppose that the modification of (2.12e) 
is taken to have the form 

pAp, = [h k ( u, v)4L + P • ( pAp) ss + £ (4.1) 

where the components of P with the basis {dk) form a constant positive-definite 
diagonal matrix and where £ is at our disposal to make (4.1) invariant. We retain 
the modification (3.6) of the compatibility equation and replace the two visible p’s 
in (4.1) with the expression coming from (3.7) | . We choose r], to control the term 
( V ■ v s ), by again adopting (3.11), thus converting (4.1) to 

pAr„ = {[h k { u, v) + V k id t v,]d k } s + P ■ [pAr, - V ■ v s - q] ss + £ (4.2) 

where 



V (s, t) ■ v s (s , /) + n]{s, t) 



C(5, 0) • v s (s , 0) + r/(s, 0) 




V kl d,vi(s, x )d k (s, t) dr 



(4.3) 
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by (3.11). Since (3.11) only involves the /-derivative ov »/, we choose the sum of 
the first two initial terms on the right-hand side of (4.3) to vanish, whence 

V (s, t) ■ l I s (s, t) + Y)(s , t) 

= d s [V kl vi(s, t)d k (s, 0] - 3.s [ v kl v,(s, r )d t d k (s, r) dt. (4.4) 

J o 

We next choose £ to ensure that the resulting version of (4.1) is invariant, without 
annihilating the expression involving P . Omitting details, we find that this equation 
involves four s -derivatives of r . 

The modifications associated with the imposition of artificial viscosity in hyper- 
bolic conservation laws support numerical methods like the Lax-Friedrichs scheme 
and the upwind schemes [4]. As mentioned above, standard versions of these meth- 
ods, which do not correspond to invariant constitutive functions, can produce se- 
rious errors. The introduction of higher derivatives with respect to s represents, in 
the language of hyperbolic conservation laws, a dispersive regularization, versions 
of which support the Lax-Wendroff and Beam-Warming schemes [4], The treat- 
ment of (4.1) shows that the invariant introduction of artificial viscosity may also 
introduce some dispersive effects. 

Slemrod [5] first associated the artifical viscosity in the equation of mass conser- 
vation for 1 -dimensional gas dynamics with capillarity. In material coordinates, the 
equation for conservation of mass becomes a compatibility equation and capillarity 
corresponds to strain-gradient effects for solid mechanics. 

It seems likely that the introduction of (properly invariant) artificial viscosity 
into compatibility equations gives additional regularity to the solutions. Thus the 
analysis of the regularized system should have advantages that more than compen- 
sate for the additional complexity. 
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Abstract. Two constitutive models that are based on the classical non-Gaussian, Kuhn-Grim prob- 
ability distribution function are reviewed. It is shown that all chains of a network cell structure 
comprised of a finite number of identical chains in an affine deformation referred to principal axes 
may have the same invariant stretch, if and only if the chains are oriented initially along any of eight 
directions forming the diagonals of a unit cube. The 4-chain tetrahedral and the 8-chain cubic cell 
structures are familiar admissible models having this property. An easy derivation of the constitutive 
equation for the Wu and van der Giessen full-network model of initially identical chains arbitrarily 
oriented in the undeformed state is presented. The constitutive equations for the neo-Hookean model, 
the 3-chain model, and the equivalent 4- and 8-chain models are then derived from the Wu and van 
der Giessen equation. The squared chain stretch of an arbitrarily directed chain averaged over a unit 
sphere surrounding all chains radiating from a cross-link junction as its center is determined. An 
average-stretch, full-network constitutive equation is then derived by approximation of the Wu and 
van der Giessen equation. This result, though more general in that no special chain cell morphology 
is introduced, is the same as the constitutive equation for the 4- and 8-chain models. Some concluding 
remarks on extensions to amended models are presented. 
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1. Introduction 

Various models of rubber elasticity are based on the non-Gaussian statistical char- 
acterization of a network of randomly oriented, perfectly flexible molecular chains 
that occupy their most probable configurations in the natural, undeformed state. 
These models use as a measure of deformation the change in length of the end-to- 
end vector r between molecular cross-links; and an affine deformation assumption 
relates the chain stretch to the macroscopic stretch of the continuum. The config- 
urational entropy of a chain, and hence its strain energy, is determined by specifi- 
cation of a probability distribution function P (r) that derives from non-Gaussian 
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statistical theory, the distribution P (r) being a rough measure of the range of varia- 
tion among the very many possible configurations of a chain [1], Some early studies 
of such distribution functions for freely jointed chains are described by Treloar [2], 
the most widely used and simplest of which is due to Kuhn and Grim [3]. 

The question then focuses on how to model a network of a great many such 
chains to obtain a continuum theory that characterizes the mechanical response of 
isotropic rubberlike materials in finite extension. Consequently, a number of spe- 
cific network models have been proposed, including the James-Guth 3-chain [4], 
Flory-Rehner 4-chain [5], Arruda-Boyce 8-chain [6], and the Treloar [7], Treloar- 
Riding [8], and the Wu-van der Giessen [9] full-network models. The work by Tre- 
loar’s group focuses on simple uniaxial [7] and biaxial [8] deformations, whereas 
the Wu and van der Giessen [9, 10] result admits general three-dimensional defor- 
mation states. All of these non-Gaussian network models use the Kuhn-Grim [3] 
probability distribution function, which is essentially a first order approximation 
developed from Rayleigh’s exact Fourier integral representation [11], In an effort 
to adjust for inaccuracies introduced through approximations in the Kuhn-Grim 
function, Jernigan and Flory [12] proposed an amended form of the distribution 
function. This idea was explored recently by Zuniga and Beatty [13] in a study 
of several amended models of rubberlike materials. Details on these models and 
several additional references citing various phenomenological models, including 
the review article [14], may be found there. The Arruda-Boyce model, however, 
has proved to be the most successful in that it is mathematically simpler than 
others, it compares most favorably with experiments on the mechanical response 
of various elastomers under diverse loading conditions, and it requires determina- 
tion of only two well-defined material constants. Nevertheless, in comparison with 
experimental data, none of these theoretical models predict fully accurate material 
response for all deformations studied, the greatest variance generally occurring for 
equibiaxial extension. 

Here we study the Wu and van der Giessen [9, 10] full-network model. Their 
major result is a formidable, three-dimensional integral type of constitutive equa- 
tion for an incompressible and isotropic, hyperelastic rubberlike material whose 
microstructure is characterized by a full-network of initially identical chains ran- 
domly oriented in the undeformed state. In applications, however, they admit that 
their rule requires time intensive numerical computation, so no specific analytical 
results have been obtained. With their result in hand, we seek a simpler but general 
constitutive equation for a uniform full-network microstructure. First, we show 
that the constitutive equations for the classical neo-Hookean (Gaussian network) 
model [2], the James-Guth 3-chain model [4], and the Arruda-Boyce 8-chain 
model [6], or the equivalent Wang-Guth 4-chain model [11], may be derived from 
the general Wu and van der Giessen equation. It is then shown that the squared 
chain stretch of an arbitrarily directed chain averaged over a unit sphere is a certain 
function of the first principal invariant 7] (B) of the Cauchy-Green deformation 
tensor B. With the aid of this result, a general average-stretch, full-network consti- 
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tutive equation valid in every reference frame is obtained by approximation of the 
Wu and van der Giessen principal stress-stretch equation. The reduced equation, 
though more general in that no specific chain cell morphology is assumed, has pre- 
cisely the same form as the constitutive equation for the equivalent 4- and 8-chain 
models. The same average-stretch procedure may be applied to full-network mod- 
els characterized by certain amended distribution functions [13], all of which arc 
thus approximated by the same formal constitutive equation, but each having a 
different isotropic elastic response function. It is shown elsewhere [15] that parallel 
results may be derived for the back stress tensor in amorphous glassy polymers. 

2. Work of Deformation 

We begin with a sketch of some relations for uniform non-Gaussian networks of 
perfectly flexible chains whose end points occupy their most probable positions in 
the reference configuration [1-3, 1 1, 12]. The non-Gaussian statistical treatment of 
a single, freely jointed molecular chain model accounts for the finite extensibility of 
the end-to-end chain vector length r up to its ultimate, fully extended chain length 
r L = Nl, where N is the number of rigid links, each of length l. In its undeformed 
state, the mean end-to-end chain vector length is given by* r 0 = */Nl [1,2]. Hence, 
the fully extended, chain locking stretch is defined by X L = r L /r 0 = \f~N . It 
is useful to define the relative chain stretch X r as the ratio of the current chain 
vector length r = r chain to its fully extended length r L . In terms of the chain stretch 
7-chain = Ghain/h), we then have 

. Ghain 7- c ii a i n _ /O 1 \ 

rL VN 

and hence X r varies from N~ 1 ^ 2 in the undeformed state to the value 1 in the 
deformed, fully extended state: N~ ->/2 ^ ^ 1 , with X r — ► 0 as N — »■ oo. 

The entropy s = k In P(r) for a single randomly oriented, freely jointed molec- 
ular chain is defined in terms of a probability distribution function P(r ) depending 
on r, where k is the universal Boltzmann constant [1, 2], Kuhn and Grim [3] de- 
rived an approximate** non-Gaussian expression for P(r) that yields the following 



* Recent analysis [16] by computational simulations has shown that as a consequence of anneal- 
ing. that is, finding the mechanically equilibrated states of network chains, each of several unimodular 
networks has a mean undeformed end-to-end vector length that is roughly 10-20% smaller than the 
classical root mean square (rms) value rg = vTVZ [1,2], This revelation, however, is of no concern 
in our current study of non-Gaussian networks for which we retain the classical rms value. 

** This is essentially the first order approximation in a series representation of the Rayleigh distri- 
bution function discussed later in Section 7. See [1, Chapter VIII] for these and other details on the 
Kuhn-Grim function for freely jointed chains. An alternative exact derivation of the chain tension 
relation/ = (1/Nl) du>(k r )/dX r — (k@/l)f), concluded without proceeding by way of the entropy 
function, is provided by Weiner [17, p. 244-247] based on the stress ensemble viewpoint. See also 
[2, pp. 108-109] and the subsequent discussion there on the non-Gaussian network theory. 
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configurational entropy for a single, randomly oriented chain, 

P 



s = k 



c — fV I k r (3 + In — 



sinh ft 



wherein c is a constant and 
p = £~\k r ) 



(2.2) 



(2.3) 



is the inverse of the Langevin function £(P). Therefore, 

1 

k r = £(P) = coth/1 , (2.4) 

P 

where we recall the relative chain stretch (2.1). 

The work of deformation, the strain energy per chain, is given by w = — &s, 
in which 0 is the absolute temperature. Hence, for the Kuhn-Grim chain entropy 
function (2.2), the strain energy per chain is determined by 

w(k r ) = k@N^k r p + ln(^-J^yj -c*, (2.5) 

c* being a constant chosen so that the energy vanishes in the undeformed state. 

The relative chain stretch in an affine deformation of an arbitrarily directed 
chain is readily related to the macroscopic principal stretches of the continuum, 
which is considered incompressible. The total strain energy for a full-network 
model, however, will depend on the orientation, distribution, and concentration 
of all of the chains in the bulk material. We shall return to this farther on. For sim- 
plicity, however, this complication may be removed by the introduction of specific 
chain cell structures. To characterize these structures, let n denote the chain density, 
the number of freely jointed chains per unit volume of the bulk material. Suppose 
that the microstructure consists of an assembly of certain unit cells of p chains 
initially oriented in p distinct directions emanating from a cross-link and each 
having a different relative chain stretch k pr . Assuming that the chain density n p for 
the /;th directed set of chains is the same for each direction, we have n p — n/ p\ and 
the distribution of the network of chains is called homogeneous. Therefore, among 
n chains per unit volume of a homogeneous distribution, the contribution to the 
total strain energy from all chains in the pth direction is (n/ p)w(k pr ). Thus, each 
of the p distinct chains of the cell contributes an amount of energy w p = w(k pr ) 
to the total strain energy W, which is given by 



fi x — ' 

w = -y Wj. 



(2.6) 



Clearly, it is not necessary to include oppositely directed chains of a symmetric 
cell structure; these have the same stretch and contribute the same energy, so p 
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may be replaced by p/2. If all chains in the cell should have the same relative 
stretch X 7 > = then Xa=i w j = P W (K)- In this case, the total strain energy per 
unit volume for a homogeneous network of non-Gaussian chains is provided by 

W(X r ) = nw(X r ). (2.7) 

We shall see momentarily that the 3-chain and 8 -chain cell structures, for example, 
arc respectively characterized by (2.6) and (2.7). 

We recall that the Gaussian (neo-Hookean) theory assumes that a molecular 
chain adopts a tight configuration with end-to-end separation r that is small com- 
pared to its fully extended length (i.e., for A cha j n \f~N). Therefore, the results for 
this model arc limited to moderate stretches for which the approximation of (2.3) 
is given by 

£«3A r . (2.8) 

It follows that the non-Gaussian, Kuhn-Grim energy function (2.5) reduces to 
the Gaussian strain energy function per chain. The variance encountered between 
predictions of the Gaussian theory and experiments at moderate strains, there- 
fore, very likely will not be significantly diminished by any non-Gaussian network 
model based on the Kuhn-Grim distribution function, as remarked by Trcloar and 
Riding [ 8 ], 

The constitutive equations for several models of interest arc sketched below. 
First, however, let us note that the strain energy function for an incompressible, 
isotropic hyperelastic material is a symmetric function W = W(X\, A 2 , A. 3 ) of 
the principal stretches Xj, subject to the incompressibility constraint A 1 A 2/.3 = 1. 
Then the stress-stretch equations for the principal Cauchy stress components 7) arc 
provided by 

Tj = —p + XjWj, j = 1, 2, 3 (no sum), (2.9) 

where p is an arbitrary pressure and Wj = dW/dXj. 



3. The James-Guth 3-Chain Model 

The James-Guth 3-chain model [4] considers a homogeneous full-network of n 
chains per unit volume oriented along the three mutually orthogonal principal 
directions of deformation, all having the same initial (ms) chain vector length 
r 0 = yfNll. Here ZV 3 denotes the number of rigid links, each of length Z, in a 
molecular chain between cross-links of the network. In an affine deformation the 
current chain vector length in the / th direction is then defined by r /c i lam = A ; r 0 , 
where X/ denotes the macroscopic principal stretch of the continuum along the j th 
principal axis. The corresponding jth relative chain stretch Xj r , in accordance with 
( 2 . 1 ), is defined by 

. X j chain Xj 

jr = = 7m’ 



j = 1,2,3, 



(3.1) 
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in which A /cha j n = r ;cha i n /r 0 = A ; - defines the / th current chain stretch and X L is 
the chain locking stretch. Since the 6-chain cell structure is symmetric and the 
distribution of chains is homogeneous, the chain density rij for the /th set of 
orthogonal chains is the same for each set: n ; = n/3, though w ; is not. Hence, 
from (2.5) and (2.6) in which p = 3, the total strain energy W = W{X\, A 2 , A 3 ) 
per unit volume for the James-Guth 3-chain network model may be written as 



IV = 



nk@ 



Nt, yU^r + In 

7=1 ' 




(3.2) 



in which the constant c 3 is chosen so that the strain energy vanishes in the unde- 
formed state; and, by (2.3), fij = £~ ] (A ;r ). Observing that XjdW/dXj = Xj r d W / 
3 Xj r and using (3.2) in (2.9), we obtain the constitutive equation for the James- 
Guth 3-chain model in the principal reference system : 

Tj = — p + —N^fiXjr, j = 1,2,3 (no sum), (3.3) 

3 

wherein /x 0 is the shear modulus in the undeformed state: 



/x o = nk&. (3.4) 

The non-Gaussian stress-stretch relations (3.3) arc determined by two parameters: 
the shear modulus no, and the number of links iV 3 in a single chain of the 3-chain 
network model. The latter controls the stiffening behavior of rubber materials at 
large strains and determines the ultimate extensibility of the network. 

With the aid of the infinite series expansion [2, 7] for £~ l (Xj r ) and use of 
the Cayley-Hamilton theorem for which B 3 = 7|B 2 — 7 2 B + 1 for an incom- 
pressible material, noting that the left Cauchy-Green deformation tensor B = 
diag[7,/. A 2 , A 2 ] in the principal reference system, we see that (3.3) may be cast 
in the familial - invariant tensorial form 

T = —pi + Wu 7 2 ; N 3 ) B + K 2 (7 lf 7 2 ; (V 3 )B 2 , (3.5) 

where the elastic response functions ^„ ( / 1 , 7 2 ; N 3 ) are defined by the infinite series 

Hi(7 1 ,7 2 ;7V 3 ) 
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A si mi lar series of terms has been absorbed in the arbitrary pressure term which 
is simply rewritten as p. It is evident that, except for its invariant tensorial rep- 
resentation in the form (3.5), there is little significant advantage gained over the 
principal axis representation (3.3), except possibly for finite element applications. 
By (2.8), for small to moderate stretches ft j ~ 3 A ; >, and (3.3) reduces easily to the 
neo-Hookean constitutive model [2] : 

T = -pl + /x 0 B. (3.7) 



4. The Arruda-Boyce 8-Chain Model 



We next develop the constitutive equation for the Arruda-Boyce 8-chain model. We 
begin with a different view point and thereby obtain a new auxiliary result. 

Let us consider a single, perfectly flexible and freely jointed molecular chain 
whose undeformed (mis) chain vector length is r 0 = 1 \f~N and whose chain vector 
r 0 = ( X , Y. Z) in the reference configuration is directed along the line X = Y = Z 
through O in a rectangular Cartesian frame 7/ = {O; C). Then, in the unde- 
formed network, the chain vector has length r 0 = Xy/3 and its direction in \jj 
is m = (1, 1, 1)/V3. (See Figure 1(a)). Now suppose that the network is subjected 
to an affine deformation in which \j/ coincides with the local principal axes. The 
corresponding principal stretches arc denoted by 7, ; and the squared stretch of a 
chain initially oriented in an arbitrary referential direction m is determined by 



3 

^ chain = m Cm = C 4 - 1 ) 

k = I 

where C is the right Cauchy-Green deformation tensor. Hence, for our special 
single chain model, (4.1) yields the chain stretch 



k chain — 



T chain 

ro 




(4.2) 



in which 7) = 7q + + Ay We recall that 7) is the first principal invariant of 

the Cauchy-Green deformation tensor C or B, each being equal to diag[Aj, \ 
in its corresponding principal reference system. The same result holds for a chain 
whose end point is initially situated along any of the three similar lines —X = 
Y = Z: —X = —Y = Z; and X = —Y = Z. Hence, (4.2) is a necessary condition 
in order that a chain may be initially oriented along any of the aforementioned eight 
directions radiating from O in the principal referential frame i//. 

Let us consider the converse question. What arc the orientations of all chains 
whose chain stretch is given by (4.2)? Consider any chain G whose end points 
in the principal frame 7/ arc at O and r 0 = r 0 m k l k initially, where m k arc its 
direction cosines. In its deformed state in 7/. the squared stretch of the chain G 
with direction m is determined by (4. 1), that is, by 

^ chain = A m \ + A m \ + ( 4 - 3 ) 
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Figure 1. Various chain cell models having the same chain stretch, (a) 1-chain structure, 
(b) 4-chain tetrahedral structure, (c) 8-chain cubic structure, (d) 4-chain semi-octahedral and 
8-chain octahedral structures. 

Now suppose that the same network is subjected to an affine deformation with the 
same values of the principal stretches as before, but with Ai and X 2 interchanged. 
The chain G now has the squared stretch 

^ chain = ( 4 - 4 ) 

for the same initial vector r 0 and the same values of the stretches X k . Of course, the 
interchange of stretches does not alter (4.2); so, if the chain stretch in both (4.3) 
and (4.4) is the invariant chain stretch (4.2), then 

(X\ - X 2 2 )(m\ - m 2 2 ) = 0 (4.5) 

must hold for arbitrary principal stretches X k that satisfy the incompressibility 
condition. It follows that m\ = m\ must hold initially for the chain G. Clearly, 
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a relation similar to (4.5) holds when the same network is again subjected to a 
deformation with the same values of the principal stretches as before, but with 
A.) and A. 3 , or X 2 and A, 3 interchanged. Consequently, m\ = m\ = m 3 = 1/3 
must hold initially for the chain vector r 0 . It thus follows that the only chain 
orientations for which (4.2) holds for all deformations with arbitrary principal 
stretches X k are chains situated along straight lines through the origin and defined 
by X 2 = Y 2 = Z 2 in i//, that is, the four lines described earlier and directed along 
the diagonals of a unit cube. We thus have the following auxiliary result. 

The chain stretch A. C h a i n in an affine deformation with local principal stretches X k 
has the invariant form (4.2) if and only if the chain is oriented initially in any of 
the eight directions from O along the diagonals of a unit cube. 

As a consequence, we have only a few possible network cell structures having a 
finite number of chain orientations for which (4.2) holds. These include the 1-chain 
structure, the 4-chain tetrahedral structure, and the 8-chain cubic structure shown 
in Figure 1, in which the origin O in Figures 1(b) and (c) is at the center of the 
cube. The 4-chain semi -octahedral and 8-chain octahedral structures in Figure 1(d) 
arc identical to the cubic structure. The 4-chain and 8-chain models, therefore, arc 
important special members of the larger geometrical class of uniform polyhedra 
networks having v chains, all with initial vector length r 0 . In the deformed state, 
a uniform polyhedron chain structure is distorted with varying degrees of rela- 
tive chain stretch among its many chains. The 4- and 8-chain network models, 
however, arc unique among these. They arc the only uniform polyhedra chain 
morphologies all of whose chains in the principal reference system have the same 
chain stretch (4.2), and hence the same strain energy per chain. Therefore, these 
geometrically similar models arc said to be isomorphic.* 

Equation (4.2) is the same as the Arruda-Boyce equation (16) in [6] obtained 
for a network model based on eight chains of undeformed vector length r 0 = l\fN% 
linked at the center of a cube and extending to its eight comers, As denoting the 
number of chain links** of the 8-chain model. Henceforward, in regards to this 
model, we shall write N = As and refer to our subsequent constitutive equation as 
the Arruda-Boyce 8-chain model. 

With (4.2), the relative chain stretch (2.1) is given by 



A. y 




(4.6) 



* This is viewed somewhat differently by Yeoh and Fleming [18]. According to them, the consti- 
tutive coincidence of the 4- and 8-chain models arises because the 8-chain model is isomorphic to a 
body centered cubic lattice comprised of two tetrahedral diamond sublattices; and the 4-chain model 
is isomorphic to a diamond lattice. 

** Wu and van Giessen [9] observed from simple extension data that A 3 ^ 3/Vg; so, experimentally 
determined values for A 3 and Ag for the same physical parameter N may vary considerably. See also 
the remarks in [13]. 
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where X L = y/N%, the fully extended chain stretch. All eight chains of the symmet- 
ric 8-chain network, in an affine deformation of its cubic structure in the principal 
frame i /r, have the same relative chain stretch (4.6). We shall assume that the net- 
work chain density n is distributed uniformly among these eight (or by symmetry, 
four) chain directions. Hence, by (2.5) and (2.7), the total strain energy per unit 
volume for the Arruda-Boyce 8-chain network model is 



W = [IqNs 



+ In 




(4.7) 



where c§ is a convenient constant, f> is defined by (2.3), and we recall (3.4). There- 
fore, the strain energy for the Arruda-Boyce model is simply a function of the 
principal invariant I\ alone. Hence, substitution of (4.7) into (2.9) yields* the con- 
stitutive equation for the Arruda-Boyce 8-chain network model [6, 19]: 



T = —pi + K(/i)B, 



(4.8) 



where B = diag[Aj, /,(, A.| ] in the principal reference system of the deformed state 
and the material response function K(/i) is defined by 



Wr) 



M- of 
3X r ' 



(4.9) 



with ft = X,~ l (k r ) and given by (4.6). 

We see that the invariant 8-chain rule (4.8) with the single elastic response 
function (4.9) is far - simpler than the corresponding 3-chain result in (3.5). For small 
to moderate values of the relative chain stretch (4.6) for which (2.8) holds, (4.9) 
yields K(/i) = /x 0 , a constant; and (4.8) reduces to the neo-Hookean (Gaussian 
network) relation (3.7), the same moderate principal stretch relation obtained from 
the 3 -chain model. 



5. The Wu and van der Giessen Full-Network Model 

The development of special network models, such as the 3-chain and 8-chain cell 
structures, avoids the more difficult consideration of random chain orientations 
studied by Wu and van der Giessen [9, 10]. A much simpler construction of their 
major result is presented next, and some basic applications follow. 

Let us consider a network cell of randomly oriented molecular - chains radiating 
from an arbitrarily chosen cross-link junction O. The other end point of each chain 
is similarly connected with other randomly oriented network chains. For a homo- 
geneous network, every network chain has the same initial, unstretched end-to-end 

* The result (4.8) in terms of principal stretches was first reported without demonstration by Wang 
and Guth [1 1, 13] for the distinct Flory-Rehner 4-chain tetrahedral model [5]. Arruda and Boyce [6] 
subsequently derived the same principal stretch result for the cubic 8-chain cell model and studied it 
extensively in experiments on a variety of rubber materials. The constitutive equation (4.8), however, 
holds in every reference system, a useful property noted in another context in [19]. See also [20], 
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chain vector length r 0 = l*/N. Consequently, each chain emanating from O has 
its end point on the surface of a sphere S of radius r 0 and volume v = 4nr^ /3. The 
volume averaged value of the strain energy per chain for the uniform network of 
chains enclosed within S is defined by 




w(X r ) du, 



(5.1) 



in which we recall the Kuhn-Grim strain energy (2.5) for a single randomly ori- 
ented molecular chain. The end point of a typical chain initially at (X, Y, Z) in a 
referential Cartesian frame <p = {O; I/,.} has the spherical coordinates (r 0 , 0o, 0 O ) 
with 0 O = [0, 7r], 0 o = [0, 2tt]. Hence, the total strain energy W = nw, per unit 
referential volume of a uniform, full-network of chain density n, by (5.1), is given 
by 



n I<2 tt ,n 

W = — / / w(X r ) sin 0 O d0o d0 o . (5.2) 

4tt J o Jo 

This is equivalent to integrating (5.1) over a unit sphere. Though here derived 
differently, (5.2) is the same constitutive equation obtained by Wu and van der 
Giessen [9] for an initially homogeneous distribution of a large number of ran- 
domly oriented chains, an equation introduced earlier by Trcloar and Riding [8], 
The initial end-to-end chain vector r 0 = r 0 m in <p has the direction cosines 



{m,(0o,0o)} = (sin 0 O cos 0o, sin 0 O sin 0o, cos 0 O ); 



(5.3) 



and the relative stretch = A, (0 O , 0o; A.* r ) of a randomly oriented chain, defined 
by (2.1), is determined by 

3 

X; = A, 2 (0o, 0o ; X kr ) = m 2 k X 2 kr . (5.4) 

*=1 

The relative principal stretches X^ arc defined by 

hr = ( 5 ‘ 5 ) 

as in (3.1). These arc independent of the chain orientation angles 0 O , 0 O . Thus, by 
(2.9) and (5.2), the Wu and van der Giessen constitutive equation for the Cauchy 
stress may be concisely written as 

T k = — p + ^ k X 2 k , k = 1,2,3 (no sum), (5.6) 

in which the elastic response functions = tCG-i , X 2 , Xj) arc defined by 

1 f 2n ml 

X k = —Ho / P(h)~r* - sm0 o d0 o d0 o , * = 1,2,3, 

J o Jo 'V 

wherein we recall (5.4) and the shear modulus /xq is given by (3.4). 



(5.7) 




76 



M.F. BEATTY 



One might expect that the full-network model (5.6) should provide better pre- 
dictions of observed experimental data than other approximate network models. 
But Wu and van der Giessen [9] do not find significantly better predictions than 
those demonstrated by the 8 -chain model, which they attribute to factors other 
than its superiority. In applications, however, the formidable constitutive equa- 
tion (5.6) must be solved by time intensive numerical methods [9], To get around 
this computational difficulty, Wu and van der Giessen introduce an ad hoc phenom- 
enological constitutive equation consisting of an additive mixture of 3-chain and 
8 -chain constitutive components with coefficients chosen to best fit their general 
constitutive equation. They thus use this somewhat simpler composite phenom- 
enological model in numerical applications in which N has the same value for 
both contributions. We recall, however, that for the separate models experiments 
exhibit distinct values for /V 3 and iV 8 . While their composite mixture rule is com- 
putationally simpler and easier to use in comparison of model results with test 
data, it replaces an amorphous structure with specific chain cell morphologies 
and it offers no significant analytical simplicity. (See also [13].) Consequently, we 
seek a general, but approximate constitutive equation that may be derived directly 
from (5.6). First, however, we shall show that three familial - special cases may be 
readily derived from this full-network equation. 

5.1. THE GAUSSIAN NETWORK MODEL 

The Gaussian network is characterized by the moderate stretch approximation (2.8). 
It thus follows that the response functions (5.7) are constants: 

3/r-o 

N* = — 1*. k= 1,2,3, (5.8) 

47T 

where 

n 7T 

m 2 k sin 9 0 d0 (} d</> 0 . (5.9) 

With the aid of (5.3) in (5.9), we obtain = i 2 = I 3 = 4zr /3; and hence each 
K/, = /zo, a constant. Therefore, the full-network model (5.6) yields the familiar 
constitutive equation (3.7) for the neo-Hookean, Gaussian network model. 

5.2. THE JAMES-GUTH 3-CHAIN MODEL 

The principal stress components for the James-Guth 3-chain model depend only 
on the homogeneous distribution of chains with density n k = n/3 in each of the 
corresponding three orthogonal principal directions, the chains in opposite direc- 
tions being equivalent. Therefore, in (5.7) we replace n with n/ 3; that is, in view 
of (3.4), iM) is replaced with /x 0 /3. In addition, because each principal direction 
corresponds to a different relative chain stretch, for the kth principal direction 
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we replace A,. with k kr in accordance with (3.1); and each of the three directed 
contributions must be taken into account. Because the chains arc oriented initially 
along the three principal directions I,, the three chain directors m, = in j k \ k arc 
the constant vectors mi = (1, 0, 0), m 2 = (0, 1, 0), and m 3 = (0, 0, 1), in accord 
with (5.3). Consequently, (5.7) is now written as 

j 1*2 7T f*Jl ' jjj - 

= — Mo/ / & i sin Oo d#o d(/> 0 , * = 1,2,3, (5.10) 

12 7t Jo JO ^ kjr 

in which [ij = X ~ 1 (A 7 > ) . Noting that m j k is the kth component of the j th principal 
chain director, we see for k = 1 that Pj m2 ji/^jr — An, for example. It is 
then easily seen that (5.10) yields 

K = ^ A k = 1,2,3. (5.11) 

3 Air 

Recalling (3.1), we have = N$k\ r , and hence with (5.11) we find that the con- 
stitutive equation (5.6) reduces to (3.3) for the non-Gaussian, James-Guth 3-chain 
model. 



5.3. THE ARRUDA-BOYCE 8-CHAIN MODEL 



The Arruda-Boyce model consists of a homogeneous distribution of identical 
chains with density n k = n/ 8 in each of the eight chain directions oriented along 
the diagonals of a cubic cell, so /x 0 is replaced with [jl 0 /8. (Because chains in 
opposite directions arc equivalent, actually we need only consider distributions 
of chains along the four diagonal directions.) All chains have the same squared 
direction cosines m\ \ and all experience the same relative chain stretch in an affine 
deformation in the principal frame aligned with the edges of the referential cube. 
Therefore, it turns out that the effect of our accounting for these eight identical 
contributions in the manner described previously for the 3-chain model is simply 
equivalent to our considering a homogeneous distribution of single chains with 
chain density n and having the direction triple 



{m k } = 



( 1 , 1 , 1 ) 

V3 



(5.12) 



Then, with (5.5), we see that (5.4) yields A : 2 = (l/(3tVg))(A^ + )3, + A 2 ), i.e. the 
same relative stretch given in (4.6). Therefore, /l(A r )/A r is now independent of the 
coordinates 6q, 4>o~ and hence with the aid of (5.12) the response functions in (5.7) 
may be written as 






A oP 

3k r 



(5.13) 
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We see from (5.13) that the all three functions = K(/i) coincide with (4.9); and 
(5.6) thus reduces to (4.8) for the Arruda-Boyce 8-chain model [6]. A parallel ar- 
gument holds for the distinct Flory-Rehner 4-chain model [5] shown in Figure 1(b) 
and again leads to (4.8), first recorded in terms of principal stretches and without 
proof by Wang and Guth [11]. 



6. An Average-Stretch Non-Gaussian Full-Network Model 



Let us return to our homogeneous full-network unit of randomly oriented molecular 
chains of density n radiating from an arbitrarily chosen cross-link junction at the 
origin of a principal reference frame <p = {0; I*}. Each chain diverging from O 
has its end point on the surface of a sphere S of radius r 0 . In terms of spherical 
coordinates, the end point r 0 = / 0 m of a representative chain has the initial direc- 
tion cosines given by (5.3) in <p. When the continuum is subjected to a deformation 
with local principal stretches A* in (p, the same end point in an affine deformation 
has the end-to-end chain vector length r cha j n ; and its corresponding squared chain 
stretch (4.1) may be written as 

r 2 . 

^ chain = = sin2 #o(Ai COS 2 0 O + A? sin 2 </>()) + A 5 cos 2 0 Q . (6.1) 

r o . . . 

Notice that upon forming the ratio A 2 = A 2 hain /A, the result (6.1) is the same 
as (5.4) in which we recall (5.3) for a chain having an arbitrary orientation in tp. 

The volume averaged value of the squared stretch of an arbitrarily directed chain 
within a sphere S of radius r 0 centered at a cross-link junction is defined by 

^chain ~ f ^chain^ - (6-2) 

v J-8 

In terms of spherical coordinates, (6.2) becomes 

^ pjz 

^chain = ^J J 0 ^c 2 hain sin 0od0od(/)o, (6.3) 

which is equivalent to averaging the squared stretch over a unit sphere at O. Hence, 
for the squared stretch (6.1) of a typical arbitrarily directed chain in the full- 
network structure surrounding the central junction at O, we find eventually from 
(6.3) the following simple, invariant relation for the averaged chain stretch*: 



'-chain — 




(6.4) 



* Note added in proof: I have discovered recently that Kearsley [24] apparently was the first to 
have derived this rule. In addition, he shows [24] that the square of the stretch ratio of a material 
area element averaged over all orientations of its normal vector is equal to 72/3. A simpler proof 
this result follows readily from the relation or = l^Q ~ 'm • m. given in [25, Equation (29.2)]. in 
which a denotes the areal stretch ratio a — da/dA of the deformed material area element da(x) to 
its undeformed element dA(X) whose outward directed unit normal vector at X is m. The average 
of o' 2 over a unit sphere with radius vector m is defined by a 2 = 1/ (4rt) / 0 71 Jq or sin 9q dSq d</>o, 
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in which f =trB =X\+X\+Xy The average end-to-end chain vector length is thus 
defined by r chain = A chain r 0 , where r 0 = l\f~N , as usual. The locking chain length 
is r L = Nl, and hence the limiting chain stretch is /./ = r L /r 0 = «/N. Therefore, 
the mean relative chain stretch X r is defined by 






^ chain 
F L 



7-chain 

Vn 




(6.5) 



The italicized auxiliary result below (4.5) shows that a single network chain has 
the relative chain stretch (6.5) when and only when that chain is specifically ori- 
ented along a diagonal of a unit cube. The 4- and 8-chain network models arc the 
only uniform polyhedra chain morphologies all of whose chains in the principal 
reference system have the same relative chain stretch (4.2). The mean result (6.5), 
however, is more general in that no specific chain cell morphology is introduced, it 
holds in the mean for any single randomly oriented chain. 

In the affine deformation, the stretch of some chains in S clearly will exceed the 
average value (6.4), while that of others will be smaller, the nature of the macro- 
scopic deformation being captured by the invariant Tf. Recalling that (5.9) with 
(5.3) yields the constant value i* = 47r/3 and introducing the mean value (6.5) 
in (5.7) for the full-network model, we obtain for the elastic response functions 
the approximate values R* = R^(ki, X 2 , k 2 ) defined by the single relation 






/H) f(X r ) 

"3 IT" 



HoPiK) 

3X r 




(6.6) 



With this estimate in hand, the general constitutive equation (5.6) for our average- 
stretch, full-network model for rubber elasticity is nicely approximated by the 
invariant relation 



T = -pl+M/OB, (6.7) 

in which, with (6.5), the shear response function /x(/|) = K/.(Ai, Xi, Xf) is given 

by 



iMf) = 



(X r ) 
3X r 



(6.8) 



It is remarkable that equation (6.7) is precisely the same as the constitutive 
equation (4.8) for the Arruda-Boyce 8-chain model. The fundamental difference is 
that the result holds more generally for an average-stretch, full-network model of 
n randomly oriented chains per unit volume; it thus characterizes the amorphous 

for all orientations m given by (5.3) in the principal frame of C. It is evident that C -I m • m may 

r\ O 

be read from the right-hand side of (6. 1) in which the Xj. are replaced by , the principal values 
of C -1 . The subsequent integration for of, therefore, yields a rule similar to (6.4). In consequence, 
we have of = / 3 CCf/j fC - 1 ) /3 = /2(C)/3, which is Kearsley’s result. 
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molecular structure of rubberlike materials. Therefore, it is not necessary to empha- 
size the heuristic 8-chain cell structure in reference to their result. Equation (6.7) 
with (6.8) is simply the Arruda-Boyce constitutive equation for an average-stretch, 
full-network of arbitrarily oriented molecular chains. 

Alternatively, we may begin with the Kuhn-Grim configurational entropy (2.2) 
for a single, freely jointed and randomly oriented molecular chain in which we 
introduce the mean relative st retell (6.5) to obtain the mean entropy s = s(X r ), per 
chain. The mean strain energy per chain is then given by w = — 05 in accord with 
(2.5); and (2.7) provides the mean total strain energy W = nw for a homogeneous 
full-network of n such chains per unit volume. We thus obtain with (6.5) a mean 
strain energy per unit volume W = W (I \), namely, 

W(h) = nw(X r ) (6.9) 

depending on only the first principal invariant of B; and, with the aid of (2.9), the 
general constitutive equation for the incompressible elastomer is given by (6.7) 
with (6.8). 



7, A General Non-Gaussian Network Constitutive Equation 

The average-stretch procedure may be applied to amended non-Gaussian mole- 
cular network models based on the Wang and Guth [11] series developments of 
Rayleigh’s exact, but formidable integral formulation of non-Gaussian chain dis- 
tributions [1, p. 314]. Their relation for A » 1 and r comparable to A, charac- 
terizes highly elastic materials and leads to a probability distribution function [11, 
equation (2.20)] that depends on only the fractional extension ratio; namely, 



/V(V) = 



1 



( 4 ) 



1/2 r 



4tt Nl 3 \j tN / 

l 

X. y 



sinh fi 



1 -Xi- 



|_/3exp(A,/3) 

2A \- 1/2 ir 

7 



i + 



<?(4; A) 
A 



+ 



(7.1) 



in which X r and ft arc defined in (2.1) and (2.3), and q (X r ; A ) is a certain function 
of X r and the chain parameter A. This development is valid for all X r e [A -1 / 2 , 1]. 

We arc mainly interested in In P N (X r ). We thus see that the first square bracket 
in (7.1) gives the Kuhn-Grim distribution that leads to the configurational en- 
tropy (2.2) and the strain energy function (2.5). This term together with the first 
term within the second square bracket in (7.1) yields the amended Kuhn-Grim 
function introduced by Jemigan and Flory [12], and recently studied by Zuniga and 
Beatty [13]. The remaining terms become increasingly negligible for sufficiently 
large A. In any such series formulation, however, we shall have some strain energy 
function w(X r ), per chain, that depends only on the relative chain stretch; and for 
all such models the constitutive equation for a uniform full-network of randomly 
oriented chains is given by (5.2). But use of the general distribution in (5.2) further 
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complicates an already difficult equation. On the other hand, our average-stretch, 
full-network model for randomly oriented chains considerably simplifies the gen- 
eral distribution function in that, by (7.1), we always have P^(X r ) = P(p)\ and for 
a homogeneous network of chain density n, we may write the mean strain energy 
per unit volume as W = —nk@ In P(I\) = W( p). The most general form of the 
constitutive equation for every model characterized by (7.1) evaluated for X r is 
thus given by (6.7) in which the general shear response function, in accordance 
with (2.9), is determined by 



iMP) = 



i dw(i r ) 

3NX r dX r 



(7.2) 



where we recall (6.5). 

It is seen from (6.3) that as A^ hain — > N, the greatest value of the squared chain 

/V _ /V 

stretch, A- hain — > N also; and hence X r — > 1. Consequently, from (6.5), in any 
given affine deformation the first principal invariant f — > /,„ , its greatest possible 
value determined by the material constant N: that is, 



7 m = 3 N (7.3) 

is a material constant reflecting the locking stretch X L = \J~N of any randomly 
oriented molecular chain and is thus named the network locking constant. It follows 
that for an average-stretch model the first principal invariant f in every affine 
deformation is bounded by the network locking constant*: 3 ^ p < /„, . The 
general response function (7.2) with X r = *JI\/I m thus involves the two physical 
constants /x 0 and /,„ . 

The constitutive equation (6.7) associated with (7.2) is valid for all deforma- 
tions of rubberlike materials; it is an equation for which general solutions of many 
boundary-value problems are well known, and for which specific knowledge of 
the response function itself is not essential. Therefore, our average-stretch, full- 
network model for homogeneous non-Gaussian networks of randomly oriented 
molecular chains is especially useful in the general formulation and solution of 
a great variety of practical problems in finite elasticity. We find that the simplest 
of these average-stretch, non-Gaussian full-network models is described by the 
Arruda-Boyce constitutive equation (6.7) with shear response function (6.8). 

* In principle, specifically, the network locking constant may be determined (actually only esti- 
mated) by a simple uniaxial experiment with limiting uniaxial stretch Am at which network chain 
locking will occur. We thus have 

I m = 3 N — k[j m + - . (7.4) 

Aim 

It follows, as described in [13], that the limiting value Aim of the greater principal stretch of the 
continuum in any other deformation is given by the equation I\ = I m . The limiting stretch (tension 
or compression) in an equibiaxial stretch, for example, is given by the two positive roots of the 
equation 2k^ m — 7 m kjj m +1=0. Clearly, the limiting stretch A; m of the continuum in an affine 
deformation is not equal to the molecular chain locking stretch 
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8. Conclusion 

We have shown that among all uniform polyhedra cell structures, the 4- and 8-chain 
models share the unique property that all of their chains in a principal reference 
system have the same stretch. Consequently, they arc described by the same con- 
stitutive equation first derived by Arruda and Boyce [6], These arc special models 
within the class of uniform non-Gaussian network models, a general model for 
which is provided by the Wu and van der Giessen [9] constitutive equation which 
is essentially based on the Treloar-Riding [8] general energy integral for a homo- 
geneous full-network of non-Gaussian chains. All arc based on the Kuhn-Grim 
distribution function [2, 3]. We have shown that the constitutive equations for 
the classical neo-Hookean, the 3-chain, and the 8-chain (hence also the 4-chain) 
models may be derived from the Wu and van der Giessen full-network model. It 
is also shown that the volume averaged, squared stretch of an arbitrarily directed 
chain is the same squared stretch that characterizes the isomorphic 4- and 8-chain 
cell models; the difference, however, is that no specific chain cell morphology is re- 
quired. With this result in hand, an average-stretch constitutive equation is obtained 
by approximation from the Wu and van der Giessen equation for uniform networks. 
We find that this reduced equation is precisely the same as the Arruda-Boyce [6] 
constitutive equation for the 8-chain model; and a si mi lar equation with a different 
shear response function may be derived for other models based on amended forms 
of the Kuhn-Grim function. The same average-stretch, full-network approximation 
is applied directly to obtain from the Kuhn-Grim function an approximate total 
strain energy function for a uniform full-network model that leads again to the 
Arruda-Boyce constitutive equation. 

It is known that experimental data on homogeneous deformations of a variety 
of rubberlike materials stand in very good overall agreement with results based on 
the Arruda-Boyce constitutive equation [6, 9], the greatest variance of the model in 
comparison with all known data arises in equibiaxial deformation. All data reported 
so far, however, focus on homogeneous deformations for which principal axes arc 
readily identified. It would be most useful and interesting to expand the comparison 
of additional experimental data on the same materials with theoretical predictions 
based on the Arruda-Boyce constitutive equation, represented in (6.7) and (6.8), for 
the torsion of bars, the twisting and inflation of tubes, and the bending of rubber 
rods, for example. 

9, Endnote: Remarks on Some Related Unpublished Work 

A reviewer has pointed out that some parallel results derivable from (5.2) have 
been reported in unpublished work by Puso [21]. Puso remarks that the strain 
energy functions for both the James-Guth and Arruda-Boyce models may be ob- 
tained from the Wu and van der Giessen (actually the Treloar-Riding) equation 
(5.2) as Gauss point approximations with six and eight points, respectively. In fact, 
no approximations appeal - necessary. It is customary to assume a priori for these 
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models that the chains arc equally distributed along the three principal axes or along 
the four diagonal lines of a cube with edges along principal axes; and, of course, 
oppositely directed chains have the same stretch and arc assumed equivalent. Thus, 
for the 3-chain case, replacing w(X r ) with each directed chain density contribution 
Wk = w( Xk) in summing over the unit sphere in (5.2) (opposite chains having the 
same stretch) and noting that (1/(4 tt)) f* sin (9 (l d(9 () dr/; 0 = 1, we see that the 
total strain energy (5.2) reduces to (2.6) (with p = 3) which thus leads to (3.2) for 
the James-Guth model. Similarly, for the 8-chain model, replacing w(X r ) with each 
Wk = w(X r ) and X r = *fT\ f3N* the total energy (5.2) simplifies to (2.7) which thus 
yields (4.7) for the Arruda-Boyce model. There is no mention of identical results 
for the Flory-Rehner tetrahedral model [5] whose four distinct chains arc similarly 
oriented along four diagonal lines of a cube, as shown in Figure 1(b), all of which 
have the same relative chain stretch X r = ^/T\J3N, as shown earlier. Plainly, the 
corresponding Cauchy stress for each model must then be obtained separately by 
application of (2.9), though Puso prefers the second Piola-Kirchhoff stress rep- 
resentation. I think this is essentially the idea perceived by Puso for derivation 
of the aforementioned constitutive equations from the Treloar-Riding total energy 
relation (5.2). 

A leading objective in [21] is to obtain a certain series approximation of (5.2) to 
provide a simplified constitutive equation in terms of stretch invariants that demon- 
strates improved accuracy over either of the special chain models and which can 
be used in finite element formulations of boundary value problems. The model 
thereby avoids integration of the Langevin function following use of (2.5) in (5.2). 
First, the inverse Langevin function in our current notation is approximated by 
ft = 3 /,, /( I — /ri), an empirical estimate that exhibits very good graphical compar- 
ison with ft = X~ l (X r ) up to X r = 0.8, after which it increases faster than the exact 
value. This estimate is introduced in the force relation given in our footnote **, 
page 3, which is then integrated with respect to X r to obtain the strain energy w(X r ), 
per chain, as a sum of certain logarithm and inverse tangent functions* . The result is 
then used in (5.2), still not integrable. The next step in the constitutive formulation 
for a chain with stretch A chain in an arbitrary direction consists of a Taylor series 
expansion of uri /-chain) = w(X r ) about the chain stretch X cham = y/Ii/3, or the 
relative chain stretch X r = I \ /3/V of the 8-chain model, an interesting idea. In 
essence, though not stated in [21], it is assumed that the stretch in an arbitrary 
direction varies only slightly from its mean value in (6.4). As may be expected, the 
zero order term necessarily, by the assumed construction, leads to the constitutive 



* I find, however, that Puso’s integral result for w(X r ), which is to be used in the subsequent 

Taylor series expansion of w(X r ), contains an error repeated in subsequent equations related to it. 
I have not confirmed all of the subsequent details. The consistent appearance of this error and others, 
including incorrect equations for /A ■ and for the second Piola-Kirchhoff stress tensor everywhere, 
while troublesome, are probably typographical slips. I note, for example, that Puso reports the correct 
value (6.4) obtained from the integral (6.3) that appears in the first order Taylor series term of the 
total energy function. 
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equation for the 8-chain model. By use of Puso’s series estimate, the previously 
non-integrable constitutive relation (5.2) in terms of the inverse Langevin func- 
tion is much simplified to an algebraic constitutive equation having the general 
tensorial representation T = FSF T = —pi + Ki(/i, 7 2 )B + hs 2 ( /i )B’, based on 
the form of S given in [21], the accuracy of which I have not confirmed. Like all 
other non-Gaussian and related phenomenological models, this one fails to capture 
effects observed in equibiaxial extension tests, though it does better than others 
in comparison with uniaxial data. Puso’s series model subsequently is modified to 
include effects due to neighboring chain entanglement interactions. The adjusted 
model shows very good comparison with equibiaxial data for natural gum rubber 
by James et al. [22], and improved though still imperfect and stiffer response in 
comparison with similar - equibiaxial data by Treloar [23]. 

Although my construction exhibits some similarities of result, it differs funda- 
mentally in its procedure, objectives, and simplicity. All of the familiar - constitutive 
equations for the Cauchy stress, including the classical neo-Hookean equation, are 
readily deduced directly from the Wu and van der Giessen representation (5.7). 
My direct average-stretch approximation of the Wu and van der Giessen equa- 
tion (5.7) shows that the Arruda-Boyce constitutive equation is equivalent to an 
average-stretch, full-network model. Moreover, it is shown that similar - constitutive 
equations of the type (6.7) with different shear - response functions hold for other 
models based on amended forms of the Kuhn-Grim function. None of these results 
have been reported elsewhere. 

Finally, let us recall that the Cauchy stress for a general incompressible, isotropic 
hyperelastic material may be written as 



^ , JW„ 3W , 

T = -pi + 2 B - 2 B 1 , 

9/ 1 dh 



(9.1) 



in which the strain energy function W = W{I i, A). Now suppose that the relative 
chain stretch in an arbitrary direction of a full-network of randomly oriented, per- 
fectly flexible molecular - chains does not stray very far - from its mean value (6.5). 
We may then obtain from the Treloar-Riding energy functional (5.2) the following 
approximate total strain energy function 



W(I i) = nwiXr) 



(9.2) 



in which w(X r ) is the approximate strain energy per chain. For all such mod- 
els, W = W(I\), a function of I\ alone; and (9.1) then has the reduced general 
form (6.7) with shear - response function (7.2). This is precisely the result obtained 
differently in (6.9) based on the mean Kuhn-Grim configurational entropy per 
chain. Indeed, when w(X r ) is given by the Kuhn-Grim chain energy function (2.5), 
it readily follows that dw(A,.)/d7.,. = Nk@/3(X r ) m , and in this case, it is easily seen 
from (9.2) that 



dw dw(i r )dx 2 r 

2 = 2 n ^ 

3/i 9 ( 7 . 2 ) dh 



M o /9 (7./) 



= /x(/,). 



37. 



(9.3) 
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Therefore, as mentioned before, (9.1) reduces precisely to the average-stretch, full- 
network result (6.7), i.e. the Arruda-Boyce constitutive equation for the isomorphic 
4- and 8-chain models. Of course, the result (9.3) is more general in that no specific 
chain cell morphology is assumed. 

The estimate in (9.2), and hence my result (6.9), essentially corresponds to the 
lowest order approximation in Puso’s series expansion of w ( ’/,, ) about the squared 
relative chain stretch of the 8-chain model. The introduction in (9.3) of Puso’s 
aforementioned empirical estimate now yields the following approximate shear 
response function for an average-stretch, full-network constitutive model: 

A(/t) = • (9-4) 

1 — Xl 



We recall that X r e [0, 1], and hence the finite chain extensibility effect is evident 
in (9.4). Indeed, recalling the network locking constant (7.3) in (6.5), we see that 
the reduced shear response function (9.4), based on Puso’s estimate, is now given 
explicitly in terms of the physical constants fi 0 and /„, : 



/}(/,) = 



A o 

l-(/i//,„) 3/2 ' 



(9.5) 



Clearly, the first principal invariant f in every affine deformation of this material 
is bounded by the network locking constant: /i e [3, The general constitutive 
equation (9.1) for the average-stretch model thus simplifies to 



T = 



- p\ + /U, 0 





(9.6) 



Though here obtained differently and expressed in altogether different terms, this 
simple result is Puso’s first order constitutive equation [21, equation (1.2.27)]. 
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Abstract. This paper represents a first attempt to derive one-dimensional models with non-convex 
strain energy starting from “genuine” three-dimensional, nonlinear, compressible, elasticity theory. 
Following the usual method of obtaining beam theories, we show here for a constrained kinematics 
appropriate for long cylinders governed by a polyconvex, objective, stored energy function, that the 
bar model originally proposed by Ericksen [3] is obtainable but enriched by an additional term in 
the strain gradient. This term, characteristic of nonsimple grade-2 materials, penalizes interfacial 
energies and makes single-interface two-phase solutions preferred. The resulting model has been 
proposed by a number of authors to describe the phenomenon of necking and cold drawing in 
polymeric fibers and, here, we discuss its suitability to interpret also the elastic -plastic behavior 
of metallic tensile bars under monotone loading. 
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1. Introduction 

Different-in-type material responses, such as ductile [1] or brittle [2], can be in- 
terpreted with a variational approach. In one-dimension, simple models with non- 
convex stored energy, of which Ericksen’s [3] is perhaps the most cited example, 
represent an extension to solids of the Van der Waals idea, which applies to a sur- 
prisingly wide range of materials. Theories of this kind, as more deeply considered 
by Dunn and Fosdick [4], allow for stress- and deformation-induced phase tran- 
sitions and they predict discontinuous strain fields in reasonable agreement with 
experimental observations. 

The basic difficulty in 1-D models of the kind given in [3, 4] is that phase- 
rearrangements are equienergetic and, consequently, minimizing strain fields arc 
in general highly non-unique. In elastic fluids, where a similar problem occurs 
when the energy is assumed to be a nonconvex function of the density, several 
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important results concerning the interfacial energy between phases were obtained 
by Cahn and Hilliard [5] by introducing a further dependence of the energy upon 
the density gradient. A number of authors have attempted to solve the congenital 
indeterminacy for solids by similarly changing the energy functional. For example, 
Can - et al. [6, 7] proposed a grade-2 model for one-dimensional bars, in which 
the energy is nonconvex in the strain and quadratic in the strain gradient. Their 
proposal is a particular' case of a more general theory earlier advanced by Coleman 
for thin polymeric fibers [8]. Such models represent the natural extension of the 
Cahn and Hilliai'd idea and, in substance, predict that configurations with least en- 
ergy are the two-phase single-interface solutions. Consequently, once the average 
elongation of the bar is known, the minimizing strain field is uniquely determined 
modulo reversal. 

The aim of this paper is to discuss the soundness of models of this kind apart 
from any agreement with experimental evidence and to exhibit their consistence 
with 3-D nonlinear elasticity theory (respecting in particular classical requirements, 
such as objectivity). The method is si mi lar to the traditional approach to beam the- 
ory using a three-dimensional parent theory, i.e., one that assumes a restricted kine- 
matics such as is common to the Bernoulli-Navier or Timoshenko hypotheses [9]. 
Flere, it is shown that for particular choices of polyconvex objective strain energies, 
the corresponding minimization problem, in a restricted class of kinematically- 
constrained deformation fields, naturally leads to the aforementioned 1-D strain- 
gradient models. 

This idea has already been pursued by Coleman and Newman, who showed 
that the 1-D model in [8] for polymeric fibers can be obtained starting from an ad 
hoc incompressible three dimensional elasticity parent theory [10, 11]. The main 
difference between their work and ours consists in the choice of the elastic potential 
for the 3-D parent theory. Here, we do not consider an incompressible material and, 
indeed, it is the material reaction against volume changes that yields the nonconvex 
(though polyconvex) character of the elastic potential; this is maintained in the one- 
dimensional reduction. Whether the nonconvexity of the 1-D reduced model is a 
strict consequence of the assumed kinematical constraints, or is a general property 
independent of any simplification, remains an open question at this stage, not even 
discussed in [10, 11]. We conjecture that this conclusion holds true even under 
weaker kinematical restrictions and for more general elastic potentials, but the 
consequent analytical complications call for a numerical approach, which will be 
considered in a further work. 

In any case, despite drastic simplifications, the present analysis shows that bar 
models with nonconvex energy may be naturally deduced from the most classi- 
cal theories for three-dimensional compressible elastic bodies. The resulting one- 
dimensional stored energy function, nonconvex in the axial strain and quadratic 
in the strain gradient, coincides with the earlier proposals of Carr et al. [6, 7] and 
Coleman and Newman [8, 10-12]. The second-order term, characteristic of non- 
simple grade-2 materials, penalizes interfaces between heterogeneous phases since 
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they must be separated by a transition zone. This is characteristic of the well-known 
phenomenon of necking in polymers and their cold drawing for increasing average 
elongation. Moreover, we recognize that also the behavior of other materials, in 
particular the yielding of metallic bars, can be interpreted using the same model; 
this bears on the original idea of Muller and Villaggio [1], 

2. The variational problem under kinematical constraints 

Let us consider a straight bar with constant cross section and length L. A reference 
orthogonal system (x, y, z), with associated unit vectors i, j, k, is introduced so 
that the / -axis is parallel to the bar longitudinal axis and passes through the centroid 
of its cross section. Let £ = £2 x (0, L ) c R 3 denote the undistorted natural 
configuration of the body, where S2 c R 2 is the domain representative of the cross 
section. A deformation is defined through the mapping y(x): 43 —* R 3 which 
satisfies the usual hypothesis, i.e., regularity, injectivity, det Vy > 0. 

The bar is made of a simple hyperelastic material defined through the strain 
potential VT(Vy): Lin + — > R, supposed to comply with the material objectivity 
requirements and with the well-known polyconvexity condition of Ball [13]. Thus, 
W takes the form 

W (Vy) = g(Vy, adj Vy, det Vy), (2.1) 

with g(-, ■, •) convex in each term. Such requirement is crucial to demonstrate 
existence theorems in nonlinear elasticity theory [13], since it implies the lower- 
semicontinuity of the energy functional. Indeed, lower-semicontinuity is assured by 
the less-restrictive quasi-convexity condition of Morrey. But this weaker hypothesis 
guarantees existence in general only if growth conditions for W arc assumed [13] 
that prohibit any singular behavior, such as the physical requirement 

detVy— >0+ =T W(Vy) -»• +oo. (2.2) 

The particular class of stored energy functions W here considered, usually referred 
to as the Blatz-Ko potential, is given by 

W (V y) = a(|Vy| 2 Tp(detVy)), (2.3) 

where a > 0 is a constant and ip is a smooth convex function such that <^(dct Vy) — » 
Too as det Vy — »■ 0 + or det Vy — > Too. Clearly, W of (2.3) is polyconvex 
and (2.2) is satisfied. It should be noticed, however, that W is nonconvex in Vy. 
Indeed, any hypothesis of convexity would imply serious physical inconsisten- 
cies, such as either failure of material objectivity and of (2.2) or unacceptable 
monotonicity of forces [14]. 

For this class of stored energy functions, the Piola-Kirchhoff stress, defined by 
S = Wy y, has the form 

S = «(2Vy T <p'(det Vy)Vy _T ). 



(2.4) 
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The assumed condition that the undistorted reference configuration is natural, i.e., 
S|v y =i = 0, implies the further requirement 

<p'i 1) = -2- (2.5) 

Fosdick and Roycr-Carfagni [15] have recently considered potentials of this kind 
to exhibit possible material instabilities. 

Let us consider a particular kinematics for S, defined through a class of defor- 
mations where a particle at x = xi + yj + zk is mapped to 

y = ((1 + v)x - vxd z f(x, y, z))i + ((1 + v)y - vyd z f{x , y, z))j 

+ fix, y, z)k, (2.6) 

where /: 3 -» R is a function to be determined. Notice that when the bar is uni- 
formly stretched longitudinally so that f z — const = 1 + e, it contracts laterally 
in the ratio (1 — ve) : 1, so that v may be referred to as the coefficient of lateral 
contraction. 

When the bar is extended in a hard-loading device, from (2.3) and (2.6) its 
energy takes the form 

E [/] = J a{2(l + v - vf z ) 2 - 2v(l + v - vf z )d z (xf x + yf y ) 

+ fl + f 2 y + fl + "V + y 2 )if 2 zx + f 2 zy + f 2 zz ) 

+ (p{fzi l + V - V fz ) 2 + V (1 + V - vf z ) 

X Vfzzixf. + yfy) - fzUxfx + yfy)T)} dV. (2.7) 

The associated vaiiational problem is the following: 

minE [/], (2.8) 

feA 

where the class A of admissible functions / is characterized by conditions 

fix, y, 0) = 0, f (x, y, L) = PL. (2.9) 

The parameter p defines the average stretch. 



3. The Particular Case of Null Lateral Contraction 

In order to analyze the vaiiational problem (2.7)-(2.9), it is helpful to consider first 
a preliminary case when the coefficient of lateral contraction v is equal to zero. 
Under this condition, the energy (2.7) reduces to 

E o[/] = f a[ 2 + f 2 x + f + f) + <p(f z )] dV. (3.1) 

Js 

For boundary conditions of the type (2.9), it can be easily seen that any minimizing 
field /* must enjoy the property f* = f* = 0 , i.e., there is no warping of the bar 
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cross sections. Moreover f*(x, y, •) = w*(-) V(x, y), where w *: (0, L) cMaM 
is a solution of the auxiliary one-dimensional problem 

min E 0 [u>|, A — {u; | w( 0) = 0, w(L ) = /3L], (3.2) 

weA 



with 

E 0 [w] = [ \j/{w\z))dz, 

Jo 

x/s(w'(z)) = aA[ 2 + (w'(z)) 2 + <p(u/(z))]. 



(3.3) 



Here, A is the bar cross-sectional area. 

In fact, for any f(x,y,z): S C R 3 a K, let Wf(z ): (0 , L) c M — »■ M 
represent its restriction to the z-axis. Clearly E 0 1 w / 1 ^ E 0 [/], where the equal 
sign holds iff f x = / v = 0. On the other hand, if w* solves (3.2), then E 0 [u;*] ^ 
E 0 | w f\. Let us then define /*(x, y, z) as the extension of w*(z) such that /*(x, y, ■) 
= !/;*(•) V(x, y), that is f* = f* = 0. We can therefore write 

EoLTl = Eolw*] ^ Eo[ W/ ] ^ E 0 [/], V/ e A. (3.4) 



In words, the terms f x and f y in (3.1) produce the coupling in the response of 
fibers parallel to the z axis, which must deform to the same extent at any cross 
section in order to minimize the energy. 

This finding is important if the function in (3.3) is not convex. In this 
situation, it has been clear since Ericksen’s analysis [3] that, for in a given range, 
two-phase solutions are highly non-unique, since phase-rearrangements arc ener- 
getically equivalent. If the longitudinal fibers of the bar behaved independently of 
one another, phase rearrangements would be completely arbitrary for each one of 
them. On the other hand, the effect of the coupling terms f x and f y in (3.1) implies 
the organization of phases in layers at right angle to the bar axis. In conclusion, 
the phase-rearrangement indeterminacy still persists in the case v = 0, but only 
longitudinally and not transversally. 

However, a substantial remark should be made at this point. If cp(-) is convex, 
as is assumed in Section 2 to assure polyconvexity of the stored energy, from (3.2) 
also i /f(-) will be convex. Consequently, it is well known that for any ( J > the unique 
solution of (3.2) is the trivial one, where the longitudinal strain is uniform in the 
whole bar. Evidently, the assumption v = 0 is too drastic and rules out a wide range 
of interesting cases. In other words, necking due to lateral contraction should play 
a crucial role. 



4 . The Bernoulli-Navier Case 

Let us now consider for (2.7)-(2.9) a further kinematical hypothesis, si mi lar to 
that of Bernoulli-Navier for beams, i.e., the deformation preserves the flatness of 
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transverse planes normal to the cylinder axis. This is equivalent to assuming that 
f(x, y, z ) in (2.6) does not depend upon x and y but, for convenience, we do 
not change notation and simply denote with / = f(z) the new corresponding 
function and with f'(z) its derivative. A restricted kinematics of this kind was 
earlier considered by Coleman and Newman [10, 11] in their derivation of the one- 
dimensional theory [8] as an approximation for the response of a three-dimensional 
long fiber made of polymeric incompressible elastic material. 

The total energy for & becomes 

E [/] = f a[ 2(1 + v - vf) 2 + f 2 + v 2 /" 2 (x 2 + y 2 ) 

JB 

+ <p(f(l + v-vf) 2 )]dV, (4.1) 

which can be integrated in the cross section, to give 

E [/] = a A £ (A (/') + ^Kf'^dz. (4.2) 

Here, the function A: M M, from (4.1), takes the form 

A (?) = 2(1 + v — vt) 2 + t 2 + <p{t{ 1 + v — vt) 2 ), (4.3) 



while the constant k is equal to 




Io, 



(4.4) 



having denoted with 7 0 the second order polar moment of inertia of the bar cross- 
sectional area. 

What should be noted in (4.3) is that, despite the fact that ip is assumed to 
be convex, the cubic c(t ) = t( 1 + v — vt) 2 is not. Consequently, A (t) may be 
nonconvex. To illustrate this aspect, let consider the following example. Let ip be 
the convex function defined by 



(Pic) = 2 



c 10 - (19 + 10 _8 )c + 10 -12 c -10 



9 + 10- 11 + 10 



c > 0. 



(4.5) 



For v = 0.1, c(t) — t( 1 + v — vt) 2 is a nonconvex cubic and, the composed 
function <p(c(t)) is also nonconvex. The graphs coixesponding to c(t), ip(c) and 
ip(c(t)) arc juxtaposed in Figure 1. 

As a result, the graph of A (t), appealing in (4.2) and defined in (4.3), is of the 
type represented in Figure 2. Notice that condition (2.5) assures that A (t) has a 
minimum at t = 1 , so that the undistorted configuration is natural. 

In conclusion, it has been shown that classical nonlinear elasticity theory may 
directly offer, under appropriate kinematical assumptions, 1-D models where the 
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stored energy is a nonconvex function of the axial strain. However, the exam- 
ple at hand contains the natural appearance of a second-order term in the result- 
ing energy (4.2), which plays the important role of penalizing interfaces between 
heterogeneous material phases. 



5. The Resulting One-Dimensional Model 

Setting u := /', we find that the problem under consideration becomes the follow- 
ing: 



PROBLEM D . Minimize 

r L r 1 

n[«] =aA A (u(z)) + -k(u'(z )) 2 dz, 

Jo L 2 

over all u e W l ’ 2 (0, L ), u > 0, consistent with the constraint 




uiz) d z = fiL. 



(5.1) 



(5.2) 



This model is well studied in the literature. It was proposed by Coleman [8] and 
Carr et al. [6, 7], and later developed by Coleman and Newman [10-12], as a gen- 
eralization of the original idea by Ericksen [3]. Here, for the sake of completeness, 
we briefly recall the main results. 

Under mild smoothness hypotheses, direct methods in the calculus of varia- 
tions [16] show existence of at least a minimizer, and that any such minimi zer 
solves the corresponding Euler-Lagrange equations. 

Now, let oq and a 2 be defined by the Maxwell conditions 

A(a 2 ) — A(<*i) = cr 0 (a 2 — aO with cr 0 = A'(ai) = A'(a 2 ), (5.3) 

i.e., the line Atop) + a 0 (t — op ) supports the graph of A (t) from below. It is easy 
to show that when fi ^ op or f ^ a 2 the solution of -TVo is the trivial one u*(z ) = 
fz, In fact, such u*(z ) minimizes n [ w ] of (5.1) for k = 0 [3] and, consequently, 

A(w*(z))dz ^ aA f A(u(z))dz 

Jo 

r L r 1 

^ aA A (u(z)) + -k{u\z)) 2 dz = n[«], Vw. (5.4) 

Jo L 2 

The most interesting case is when a\ < fi < a 2 , whose characterization is given 
by the following theorem, contained in [6, 7]. 

THEOREM (Carr, Gurtin and Slemrod). Assume that A is a smooth noncon- 
vex function of the form shown in Figure 2 and, more precisely, A is of class 
C 5 ( 0, L), A" > 0 on (0, a\ ) U (a 2 , oo) and A" < 0 on ( a \ , a 2 ), A'(0 + ) = — oo 
and A'(+oo) = +oo. Then, for any f> e (ai, a 2 ): 



n[M*] = 



= a A 

Jo 
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(i) when k > 0 is small enough, problem ■ < P\-o has a unique (modulo reversal) 
solution u*(z); 

(ii) u* (z) is strictly monotone; 

(iii) As k —* 0, u*(z ) approaches the single interface solution u*(z), defined as 
( modulo its reversal) 



| a i 0 ^ z ^ 

1 a 2 l < z ^ L, 



(5.5) 



with l = L(a 2 ~ f)/( a 2 — of i). 



In other words, for infinitesimal k the higher order term causes the two-phase 
solutions with least energy to become the single-interface solutions. For the dis- 
cussion of this very important case, we report an elegant argument by Alberti [17], 
providing an elementary characterization of the transition zone between hetero- 
geneous phases. To this aim, it is convenient to consider a problem equivalent to 
P\-d, i-e., 

mini A~(u(z)) + }-k(u' (z)) 2 dz, [ u(z)dz = /3L, (5.6) 

Jo 2 J o 



where 



A (u(z)) = A (u(z)) - [A(aq) + cr 0 (u(z) - an)], (5.7) 

and ct o has been defined in (5.3). With this choice, A~(-) is non-negative and 
presents only two absolute minima at a i and a 2 , with A=(«i ) = A=(a 2 ) = 0. 
It is well known that minimi zers of (5.6) coincide with solutions of tP\-n since any 
lineal - functional is a null-Lagrangian for n [m] in (5.1). 

Now, case (iii) of the aforementioned theorem suggests that when f> e (cy i . a 2 ), 
the material tends to be separated into two different phases, but their interface is 
not sharp because of the gradient term in (5.7). The two phases are connected by 
a transition zone, which produces an effect in the bar equivalent to a distortion. 
Under De Saint Venant’s principle, the effects of such a distortion should be negli- 
gible at distances that are large when compared with the diameter of the bar cross 
section. Consequently, if the length of the bar is much larger than the diameter of 
its cross section and f is far from c/\ and a 2 , recalling (5.5) the transition zone is 
certainly distant from the bar extremities. Consequently, the following optimal pro- 
file problem may be considered for the determination of the shape of the transition 
zone. 



PROBLEM -Pop (Optimal Profile). Find u: R — 1R such that 

r +° ° r 1 1 

aA A = (w(z)) + -k(u'(z)) 2 dz = min, 

J — OO L ^ 

lim u(z) — on and lim u(z) — a 2 . 

Z-*~oo Z-S.+00 



(5.8) 
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Following the argument proposed by Alberti [17] for the T -convergence of 
Cahn-Hilliard models, the inequality r 2 + s 2 ^ 2r ■ s with r = (^k)^ 2 li and 
5 := V A=(m(z)) is applied to (5.8)i to give 



ciA 



L 



+oo r 



A ~(u(z)) + \-k(u'(z )) 2 dz 



^ 2aA 
^ 2aA 



/ +°° n 

\/ A=(u(z)) ■ y -K u\z ) dz 

r 2 fi I 

J ^2 l<A= ^ du = y ' 



(5.9) 



The quantity 

r 2 /i I 

y = 2r/A / J -kA=(u) du, (5.10) 

J a\ y ^ 

represents the energy necessary to produce an interface between the two phases 
and depends only upon the shape of A= and, consequently, of A. Noticing that 
r 2 + s 2 = 2r ■ s if and only if r = s, we find from (5.9) and (5.10) that the lower 
bound y for the integral in (5.8)i is attained provided u satisfies the differential 
equation 




\Z) = y/A -( H ( Z )), 



(5.11) 



with boundary conditions (5.8)2. Notice that u is monotone and u' — > 0 as it 
approaches the value a\ or a 2 . 

To illustrate the characteristics of the transition zone in one example, suppose 
that A takes the form 



A (u) = bi{u — ofi) + b 2 + — -/c inf {( u — a i) 2 , (, u — a 2 ) 2 } 



(5.12) 



for some constants b\ , b 2 and f . Rigorously speaking, expression (5.12) does not 
follow immediately from (4.3), but since what is important is the nonconvex char- 
acter of A in a neighborhood of (ai, a 2 ), it is assumed that there exist particular 
choices of <p(-) and v such that (4.3) is well approximated by (5.12) in such a 
neighborhood. 

From (5.3) and (5.7), A~ becomes 



A~(u) = —k inf{(M -o' 1 ) 2 , (u-a 2 ) 2 }, 

and the differential equation (5.11) has the form 

\u — a\\ a i+q -2 

tor a i A u < , 



u — 



K 

I u - a 2 \ 

7 



«! + a 2 

for ^ u A a 2 . 



(5.13) 



(5.14) 



2 




NONLINEAR ELASTICITY AND 1-D BARS WITH NONCONVEX ENERGY 



97 



By symmetry, the boundary conditions (5.8)2 may be replaced by requiring 
«( 0) = (aq +q! 2 )/ 2 , so that a simple integration gives the following solution u*(z): 



u*(z) = 



^(a 2 -oq)exp^^ +oq forz<0, 
^(<*1 — a 2 )exp^— +a 2 for z > 0. 



(5.15) 



It should be noted that f represents a characteristic length for the transition 
zone. Moreover, from (5.10) the energy y necessary to produce the transition 
(interface energy) is 



K 

y = aA — 




(5.16) 



which depends also upon k. 

Recalling (2.6) and the notation m = /', we see that the bar undergoes a lateral 
contraction equal to v(u — 1). Consequently, for ft e (oq,^), the shape of the 
deformed bar would appeal - as represented in Figure 3. The lateral contraction of 
the bar is v(oq — 1) or v(a 2 — 1) for the portions on the left-hand side or on the 
right-hand side of the transition zone respectively. Provided that £ <& L, the length 
of the phase 1 -portion is approximately 



/ = 



a --?L 

C/,2 ~ Oil 



(5.17) 



The quasi-static equilibrium states of the bar, as the average stretch [ J > is gradu- 
ally increased starting from the undistorted state /i = I , arc as follows. 

(i) For 1 < p ^ oq, the bar is uniformly stretched. The axial dilatation is (ft — 1) 
and the lateral contraction is v — 1). 

(ii) For oq < ( J > < oq. the deformation of the bar is as in Figure 3. As /3 increases, 
one phase evolves at the expense of the second phase, gradually invading the 
whole bar. 

(iii) For ft ^ cy 2 , the bar dilatation becomes again uniform and equal to (/l — 1) as 
in case (i). 



Necking Undistorted configuration 




Figure 3. Predicted necking in the transition zone between heterogeneous phases. 
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In practice, the energy consumed in producing the necking is interpreted as a 
surface energy between heterogeneous phases, assuring uniqueness (modulo rever- 
sal) of minimizers also in stage (ii). As ft is increased the neck moves along the bar, 
changing the ratio between the length of the regions occupied by heterogeneous 
phases in order to accommodate the average elongation f J >. This particular - motion 
is usually referred to as drawing [8]. 



6. Comparison with Experiments and Conclusions 

The model defined by (4.2) is a particular case of a general theory appositely con- 
ceived by Coleman [8] to describe the cold drawing in polymeric fibers such as 
nylon. In the undistorted condition, nylon is composed of long chainlike molecules 
that are oriented at random. Under a tensile force increasing from zero such fila- 
ments, tangled together, do not stretch much at first, but when a certain critical load 
is reached, the long molecule chains suddenly rearrange themselves, so that they 
become parallel to the filament axis. At this instant, a sharp constriction forms in 
the filament profile, in general nucleated at one of the extremities where the fiber 
is clamped by the loading device. A moment later, the contracted portion is seen 
to increase its length and the “wave front” moves along the specimen towards the 
opposite end (drawing), due to the successive rearrangement of the molecules in 
the neighboring portions. Eventually, the filament shape looks as represented in 
Figure 4, where the correspondence with the behavior of Figure 3, predicted by the 
model, is evident. After the entire filament has contracted, all the molecules have 
been rearranged and, if the fiber is further pulled, the longitudinal strain (and the 
consequent lateral contraction) increases uniformly throughout the bar [18]. 

It is perhaps less known that such a behavior, despite being caused by a mecha- 
nism completely different at the microstructural level (dislocation movements), is 
also analogous to the manner in which a bar of mild steel elongates. Metals have 
the characteristic property of possessing a well-defined “yield point”, at which they 
start to stretch permanently. At the macroscopic level, the load vs. displacement 
curve is characterized by a pseudo-horizontal plastic plateau, but careful exper- 
iments [19] reveal that plastic strain does not progress uniformly throughout the 
bar. When one element or layer yields, it strains a few percent almost instantly and 
then nearly stops, while yielding is transferred to the neighboring portions. 

In both examples, a microstructural rearrangement (molecular alignment or 
plastic slip) produces a strain jump with the same characteristics as a phase tran- 




Figure 4. Constriction forming on thin filaments of nylon after pulling (from [18, p. 300]). 
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sition, which occurs in 1-D elastic bars with nonconvex stored energy. Indeed, 
Muller and Villaggio [1] recognized that Ericksen’s model [3] was suitable also 
for an elastic -plastic body. However, if the segments of the bar could yield inde- 
pendently of one another, like the rings of a chain, the process would be highly 
chaotic, in agreement with the non-uniqueness inherent in Ericksen’s model. On 
the other hand, experiments on metals show an orderly deformation si mi lar to that 
of polymers. A possible explanation stems from the observation that the yield point 
of metals is greatly influenced by any stress concentration. There is a wealth of 
experimental evidence [19] that localized yielding produces a condition equivalent 
to a stress concentration at the boundary of the yielded portion, which can greatly 
influence [20] the behavior of the neighboring parts (nonlocal effect). The yielding 
of the first portion produces a chain reaction that produces the spreading of plastic 
deformation through the specimen si mi lar to the cold drawing of polymers. For the 
model of Section 5, this nonlocal effect is due to the constriction shown in Fig- 
ure 3 that, while connecting heterogeneous phases, produces a condition somehow 
equivalent to a stress concentration. The energy consumed by the formation of such 
a transition plays the role of an interface energy. 

In the class of models discussed here, the nonlocal effect is interpreted by the 
presence of the quadratic term in the strain-gradient which appeal's in the energy 
functional (4.2). A number of authors have proposed to add strain-gradient de- 
pendence in order to regularize Ericksen’s problem and to solve the congenital 
equivalence of phase rearrangements. In this paper we have emphasized that, for 
a specified restricted kinematics, classical 3-D nonlinear elasticity theory naturally 
yields a 1-D formulation with the aforementioned characteristics. In particular', in 
our derivation the nonconvex strain dependence of the energy, that dependence 
which allows for phase transformations, is inevitably associated with the appear- 
ance of a convex term in the strain gradient, thus providing the nonlocal effect 
necessary to reproduce the phenomena of necking and drawing. 
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Abstract. The balance equations of mass, momentum, energy and entropy at a phase boundary 
imply phase boundary conditions which determine the position of the boundary as a function of 
temperature. This is true when either the phase boundary is sharp or when it occurs through a 
transition zone, albeit the latter case seems to require strongly symmetric geometry. 
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1. Eshelby Tensor as a Tensor of Free Enthalpy 

Viewed macroscopically phase boundaries may sometimes be sharp boundaries 
between uniform phases. Such is the case in a liquid-vapour or a solid-liquid 
phase boundary; some solid-solid phase boundaries are of that type too, notably for 
martensitic transformations. In such cases the equations of balance of thermody- 
namics assume the form of jump conditions on a singular surface, the phase bound- 
ary. In addition to the balance equations coherency of the phases requires kinematic 
compatibility conditions which relate the jumps of the velocity components across 
the phase boundary to the mass rate of the phase transition. 

Thus, one obtains an entropy inequality on the phase boundary which implies 
that the mass rate of the phase transition is proportional to the discontinuity of 
the normal component of the Eshelby tensor. This idea goes back to the procedure 
of lineal - irreversible thermodynamics (e.g., see [1]), by which the thermodynamic 
forces and fluxes, whose product forms the entropy inequality, are proportional. 
This idea has been adapted in [2, 3] to the motion of a phase boundary. In par- 
ticular, in equilibrium the normal component of the Eshelby tensor is therefore 
continuous. This equilibrium condition generalizes the thermodynamic condition 
of phase equilibrium by which the specific free enthalpy - or Gibbs free energy 
- is continuous. In simple cases the condition gives rise to the “common tangent 
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construction”, which in thermodynamics is a familial - way to determine the load 
and deformations of phase equilibria as functions of temperature. 

The usefulness of the knowledge about the continuity of the Eshelby tensor 
is strongly qualified as a universal tool by the need to know the generally non- 
uniform stress and strain fields adjacent to the phase boundary. The determination 
of such fields requires numerical calculations and is therefore restricted to specific 
cases. 

More often than sharp phase boundaries we observe a transition zone between 
the phases, in particular - between two solid phases. Such a zone is narrow but 
macroscopically noticeable. And while we cannot employ singular surface analysis 
in such cases, we may integrate over the surface of the zone and thus compare 
stresses and strains in the regions of the body adjacent to the transition zone. In 
simple cases of high symmetry this procedure can lead to a valiant of the “common 
tangent construction”. 

In recent years phase transitions in solids have become important, since struc- 
tural elements of machines are subjected to high temperatures and strong tensile 
forces which can affect the size and shape of inclusions. After the pioneering work 
of Eshelby [4] for elastic inclusions, the thermodynamic character of the Eshelby 
tensor was first recognized by Heidug and Lehner [5]. More recent references are 
the works by Truskinovsky [6], Gurtin [7] and Liu [8]. 

Under most circumstances the exploitation of the phase boundary conditions 
requires extensive numerical calculations and such calculations have been made 
by Schmidt [9] and R. Muller [10], who have calculated the shapes of inclusions in 
solids. 

The present work emphasizes the similarity of the Eshelby tensor with the 
free enthalpy, or Gibbs free energy of thermodynamics. After the derivation of 
the general form of the phase equilibrium conditions, we specialize to a liquid- 
vapour transition and to a solid-solid transition under shear. We also discuss the 
case when the transition does not occur on a sharp interface but in a narrow but 
smooth zone. 

The mathematical formulation of the coherency of the phases makes use of the 
displacement field, i.e., the position of material points relative to their positions at 
an initial time, or in a reference configuration. And the displacement gradient af- 
fects the Eshelby tensor. This fact has given rise in the literature to the ideas that the 
proper environment of the Eshelby tensor is a “configurational space”; see [11, 12], 
who have gone so far as to call for a “configurational physics”. Similar ideas about 
“configurational balances” have been proposed by Podio-Guidugli [13]. We see 
no need for all these and we firmly place the Eshelby tensor in “physical space”, 
where it belongs in our opinion, along with such classical subjects as stress, strain 
and free enthalpy. 

To keep arguments simple we have omitted to endow the phase boundary with 
properties of its own like surface stress or surface energy. The formulae needed for 
the consideration of such properties are available in [14] or [15]. 
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Figure 1. Sharp phase boundary. 

2. Equations of Balance 

Figure 1 shows a cross section of a sharp phase boundary that separates the phases 
(+) and (— ) and has the unit normal n, pointing from (— ) into (+). The velocity 
of the boundary is V, = V n , and the velocities of the phases on either side are \f. 

The conservation laws of mass, momentum and energy on the phase boundary 
read 

[p(v; - Vi)]m = 0, 

[pv j (Vj - Vi) - tji]rii = 0, 

p(e + ^v 2 j(v, - Vi) - tjiWj + q i 

They state that the fluxes of mass, momentum and energy in and out of the surface 
arc equal; the convective fluxes arc relative to the moving surface. Equations (2.1) 
employ the canonical letters for the physical quantities. Thus p is the mass density, 
v, the velocity, tjj the stress, e the specific internal energy, and q, the heat flux. 
Square brackets denote the difference of the bracketed quantity on the two sides 
such that [c] = c + — c~ holds. 

The balance of entropy states that the efflux of entropy from the surface is bigger 
than the influx. Thus, if we suppose that the efflux is on the (+)-side and the influx 
is on the (— )-side the balance reads 

pr)(\i - Vt) + y n t =a ^ 0 ; ( 2 . 2 ) 

i] denotes the specific entropy, T the absolute temperature and a is the surface 
density of entropy production. 

It is reasonable to assume that the temperature is continuous. Therefore \q, ]/?, 
may be eliminated between (2.1) 3 and (2.2). With i/r = e — T q as the specific free 
energy the result reads 

p(V + ^ y2 V v -L - v ) ~ tj'Vjfii 




( 2 . 1 ) 

ni = 0 . 




= -To ^ 0 , 



( 2 . 3 ) 
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where v_l is short for v,«,. We define the arithmetic mean value (c) = j (c + + c ) 
and use (2. l)i j2 as well as the identity 

[ab] = [■ a](b ) + (, a)[b ] (2.4) 

to rewrite (2.3) in the form 

mp(v±. - V ) - (iy i fi i > [v y ] ^ 0. (2.5) 

In words this means that the difference between efflux and influx of the free energy 
is smaller than or equal to the power of the mean stress vector (tj,n,) on the relative 
velocity [v,-]. 

An alternative form of (2.5) results if we decompose [v ; ] into its normal and 
tangential components according to 

2 

I Vy | = [vjHy + 2 >,„]t7, (2.6) 

a=l 

where r“ arc two orthonormal tangent vectors on the phase boundary such that 

r“r j = 8 a p holds; V||„ = Vy-r“ arc the corresponding tangential components of v ; . 
Thus we obtain from (2.5) 

* - itjiniTlj) ]p(y± - V ) - V(ty ( m ( r“)[V| |a ] ^ 0, (2.7) 

L P J ^ 

so that the power of the mean stress vector is split into three parts due to the normal 
and tangential components of (ty,«, ). 



3. Kinematic Compatibility 

Coherency of the phases implies that the displacement vector rq is continuous 
across the phase boundary, i.e., 

[Wy]=0. (3.1) 

(The displacement field «, (Xy, t) refers to a material particle which at time t is 
at the position xj, while at some prior time t 0 it was at we have n, (Xy, t) = 
Xi(x°j , t ) - x ( °.) 

The gradient duj/dxi and the time derivative duj/dt may suffer jumps but, 
because of (3.1), the jump of the gradient cannot have a tangential direction and the 
jump of the time derivative is related to the normal speed V of the phase boundary. 
We have 



ditj 1 , 

= cijUj and 

dxj 





= -V 



(3.2) 
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These conditions, where a, is an arbitrary vector on the phase boundary, arc called 
kinematic compatibility conditions (e.g., see [16, p. 503 ff]). The velocity is related 
to the derivatives of the displacement by 



duj duj ( 

Mi = [Vi , hence v,- = o, , 

3 t 1 dxj 1 V 



3 Uj\ 1 duj 
3 X; / 3 1 



(3.3) 



We recall the decomposition (2.6) of [v,-] into normal and tangential components 
and use (2. l)i , (2.4), (3.2) and (3.3) to determine these components in terms of the 
mass rate of the phase transition 



[vj = 




p(v± - V), 



[V||«] 



-n 




UiP(M _ l - V ). 



Bki 



(3.4) 



By (3.4) the jumps of all three components arc proportional to the mass rate 
p(v_l — V) of the phase transition. For the normal component the factor of pro- 
portionality is [1/p], while for the tangential components the factors arc complex 
expressions in terms of the densities and displacement gradients of the phases. In 
the sequel we simplify by introducing the abbreviation B ki as indicated in (3.4) 2 . 



4. Entropy Inequality, Speed of Phase Boundary, Equilibrium 

We eliminate [vy] between (3.4) and (2.5) and obtain an alternative form of the 
entropy inequality or free energy inequality, viz. 



(f - + X>,T")r ;b u 

S ' V ^ 

Eshelby tensor gu 



n,nip(Mj_ - V) ^ 0. 



(4.1) 



The bracketed quantity defines the Eshelby tensor; here we denote it by g ik in 
order to emphasize its kinship with the free enthalpy or Gibbs free energy, which 
we shall exhibit later and which is usually denoted by g. Thus (4.1) may be written 
in abbreviated form as 



\gi l \nin,p(y ± - V) ^ 0. (4.2) 

The left-hand side of (4.2) may be interpreted as a product of a thermodynamic 
force [§/, ]«;«,- - the discontinuity of the normal component of the Eshelby tensor, 
and a thermodynamic flux p(y±_ — V) - the mass rate of the phase transition. 

The inequality (4.2) is satisfied by setting 



P (v_l - V) = N[g ik ]nin k , 



(4.3) 




106 



G. BURATTI ET AL. 



where IV is a negative factor of proportionality. Thus the mass rate of the phase 
transition is proportional to the jump of the normal component of the Eshelby 
tensor. 

In particular, in equilibrium, when the phase transition comes to a standstill we 
must have 



lgik]\E>Vn k = 0. 



(4.4) 



The index E refers to equilibrium, where v_l = V holds. In that case (2. 1) 2 implies 



[tji]\ E ni = 0, hence (t ji )\ E n i = t^n «, 



(4.5) 



so that the stress vector t /; n, is continuous. We obtain from (4.4) and (4.1) 



[gik] I Enin k 




-OL -U D 
^ j 



YliYli — 0 , 

E 



(4.6) 



where B ki is defined in (3.4) 2 . We shall refer to this equation as the Eshelby 
condition for phase equilibrium. 

Given the continuity of the temperature across the phase boundary and the 
continuity of the stress vector and the continuity of the normal component of the 
Eshelby tensor we can determine the position of the phase boundary as a function 
of temperature. 

We proceed to study two special cases. 



5, Liquid-Vapour Phase Transition 

We envisage a situation as shown in Figure 2(a) with liquid and vapour of a fixed 
mass m and in a fixed volume V. And we ask for V { ~\V, T ), the volume of the 
liquid in equilibrium as a function of the total volume V and temperature T. 



V(1/P,T) 




(a) (b) 

Figure 2. Common tangent for liquid-vapour transition. 
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In equilibrium the stress of both liquid and vapor reduces to an isotropic pres- 
sure p, i.e., in (4.6) we have tp = —pSp- Therefore the stress vector tprii has no 
tangential component and (4.6) reads 




or 



P 



m 

[1 /pV 



(5.1) 



since, by (4.5), the pressure is continuous in this case. Equation (5. 1), which repre- 
sents the Eshelby condition in the the present case, expresses the continuity of the 
specific free enthalpy or Gibbs free energy. 

We recall from basic thermodynamics that p = — 3i/r/(3 1/p) holds in the liquid 
and the vapour. Therefore it follows from (5.1)2 that 



dxf/ 

W 



3i/f [x//\ 

Wp = Wi 



(5.2) 



holds, which implies the well-known “co mm on tangent construction”. This is a 
graphical method for determining the densities p ± (T) as functions of temperature. 
These densities or, equivalently, the specific volumes 1 /p ± (T) result as the abscis- 
sae of the points of contact of the tangent common to the two free energy functions 
i/r“(l/p, T) and 7/ + ( I / p . T ); see Figure 2. The pressure needed to maintain the 
volume V is given by the slope of the tangent; it depends on the temperature. 

We arc thus able to reach our objective and determine V i ~ ) as a function of T, 
given V. With m being the mass of the fluid we obtain 



V~(V, T) — V 



m / V — p + (T) 
P~(T) — p + (T) 



(5.3) 



Of course, this solution did require the knowledge of x// (+) (l /p,T) and 
i/d -) (l/p, T). A well-known example for which this knowledge is available is the 
van der Waals gas, where are given by the two convex parts of the free energy 
function. 



6. Solid-Solid Phase Transition under Shear 



We consider a situation as shown in Figure 3(a) where the upper plane of an elastic 
body is displaced by u,(H) = (U, 0, 0). The transformation strain between the 
phases (+) and (— ) is given by 



e <+> 

b 'J 



b ,J b lJ 



0 e 0 

£00 

0 0 0 



( 6 . 1 ) 



We ask for X(U, T), the position of the phase boundary in equilibrium as a func- 
tion of temperature, if U is given. 
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Figure 3. Common tangent construction for solid-solid transition in shear. 



We assume that both phases are linearly elastic isotropic bodies with equal 
Lame coefficients // and X. Therefore the stresses tjj in terms of the strains Sjj = 
\(duj/dxj + du//dxj ) are given by 

f<+ } = 2^ - Bij) + *4% , = 2l us^ + Xs^-% . (6.2) 

In both phases the equilibrium condition dt/j /dxj = 0 must be satisfied. It follows 
that only the 1 -components of the displacement fields arc nonzero. They read 



u\ +) = 



.(-) 



U X 

h 2e — 

H H 



(x 2 -H) + U, 



+ 2s 



X 

77 



- 1 



x 2 . 



(6.3) 



These fields satisfy the boundary values u\ +l (H) = U, Mj _, (0) = 0 and the jump 
conditions (3.1) and (4.5). It remains to exploit the condition (4.6) on the continuity 
of the Eshelby tensor. 

Obviously from (6.3) we have 

£ i 2 ’ = e 2t' ) = 2H^ U + 2sX ), 

= 4° = + 2 e(X - //)), (6.4) 

tlt ) = tn=^(U + 2e(X-H)), 

tl 

and p i+) = p (_) holds, since sn = 0. Without loss of generality we set r“ = 
(1 , 0, 0) and r" = (0, 0, 1) on the boundary. Therefore (4.6) reads [ i// +t\ iB\ 2 \ = 0 
or, with (cf. (3.4)) [5 12 ] = -(l/p)[9wi/9x 2 ], 



pty ~ hi 



9«i 

9x 2 _ 



= 0 or, by (6.1), (6.3), t ]2 



[2ei 2 ] ‘ 



(6.5) 
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Equation (6.5) i or, equivalently [px// — tn2sn\ = 0, represents the Eshelby 
condition in this case. Clearly it may be expressed as the continuity of the free 
enthalpy density appropriate to shear loading. 

We recall from I i near elasticity that the free energy densities of the phases arc 
given by 

pf w = pf ,+, (T ) + fi(4 +> - %)(4 +> - ~ e u) + x4 +)2 and 

. 2 ( 6 . 6 ) 

pV~ ] = Pf ( ~\T) + ne\j ) s\ j 1 + -4 )2 , 

so that t \2 = dpxjr / (32ei 2 ) holds; f (± HT ) arc the specific non-elastic, thermal 
parts of the free energy density. Therefore (6.5) may be written in the form 



3 pxfr + dpi// 

d2s\2 d2si2 



[pty~\ 

[2e n \ ’ 



(6.7) 



which implies the common tangent construction: for the determination of sf 2 (T) 
and h 2 (T); see Figure 3. 

We arc then able to calculate the position X of the interface from (6.4) i as 



X(U, T ) = 



2 H£ { +\T) - U 
2{e { + ) (T)-e\- ) (T)) 



(6.8) 



Given (/, the position changes with temperature. 



7, Extrapolation to Transition Zones Rather than Sharp Boundaries 

It is cleai - that simple and clear-cut considerations like those in Sections 5 and 6 
require uniform fields of stress and strain up to the phase boundary whose position 
may then be calculated as shown. This does not work very often. Usually the fields 
arc non-uniform and not known analytically. Also neither the position nor the shape 
of the phase boundary is known. In such cases extensive numerical calculations arc 
needed to obtain solutions inside the phases that arc coherent at the boundary and 
satisfy the jump conditions of thermodynamics. 

Such calculations have been made (e.g., see [9, 10]), and they have been used to 
determine the size and shape of precipitates in alloys. Actually such investigations 
arc close in scope to the motivation of Eshelby ’s original paper [4], in which the 
Eshelby tensor appeared first. 

Not wanting to enter such complex numerical studies we shall rest content to 
point out, in this section, certain simple cases in which stress and strain arc not 
uniform and yet definite simple answers can be found like those of the previous 
sections. 

Specifically we shall consider circumstances in which the non-uniformity of the 
stress and strain fields is confined to a narrow range, the transition zone between 
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Figure 4. Tensile and compressive rods. 



two phases. Typical examples arc shown in Figure 4: a cylindrical rod under tension 
undergoing a phase change that is accompanied by a change of cross section. And 
a rod under compression where the lateral expansion is constrained by a rigid pipe. 
In both cases there is a transition zone of non-uniform stress but outside that zone 
it is reasonable to assume uniformity. 

We apply the equations of balance of mass, momentum, energy and entropy 
to material volumes £2 whose surfaces 3 £2 arc indicated by the dashed lines in 
Figure 4 so that the cross-sections lie outside the transition zones. The surfaces 
move with the bodies and the mantle parts of 3 £2 contribute nothing except possibly 
heating. Therefore the balance equations read 

{pv,»,A} = 0, 

{pv ; v«n,A - tjifiiA} = 0, 

| P ( £ + ^ v2 ) v '"' A “ t j' n ' v J A J + Qi n i dA = 

{priVitiiA} + f ^ dA ^ 0. 

JdQ, 1 

The brackets denote differences between the two cross-sections, such that {c} = 
c + — c~ . Note that the cross-sections may be different in area A. In the case of 
the tensile rod density and stress vanish on the mantle, while in the case of the 
compressed rod the contributions of stress on the mantle cancel each other because 
of symmetry. 

We assume that T is uniform and eliminate the heating between (7. 1 ) 3 4 . Thus 
we obtain 
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or, by (2.4), which holds for the curly brackets as well as the square ones, 



{WpVjUjA - {t ji n. i A){Yj} sC 0. 



(7.3) 



In both of our examples v,- is normal to the cross-sections and, by (7. l)i , we 
may therefore write (with v,/i, = v_l) 



{vy} = 




n jpv± A 



so that (7.3) assumes the form 



(7.4) 



i jr 



( tjjUjnjA ) 

pA 



pv_i_A ^ 0. 



(7.5) 



As before, in Section 4, we argue from (7.5) that the mass rate p\'±A of the tran- 
sition is proportional to its factor in (7.5) with a negative factor of proportionality 
so as to satisfy the inequality. Thus with /, ; n,/7 ; 4 as the tensile or compressive 
force P we have 



pvj_A = N\x// - 

pA 



and in phase equilibrium 



(7.6) 



{ 



\J/ — 




= 0, 



(7.7) 



since, by (7.1) 2 , P is the same on the two cross-sections. 

The phase equilibrium condition (7.7) is formally idential to the case of the 
liquid-vapour condition, cf. (5 . 1 ) i . 

Note, however, that when A + and A~ are unequal - like in the tensile rod - 
it is not the stress P/A which is equal in the two phases but the load P. In such 
a case there is an interesting alternative form of (7.7) which results by extending 
numerator and denominator by the lengths L + and L~, see Figure 4. In this way 
with pAL — m we obtain 



4-/>-)=0 or P = M (7.8) 

l m J {/} 

where /* = L ± /m ± are the specific lengths of the phases. 

Along with P = (dx/s/dl ) | + = (d%ls/dl)\~ this gives rise to a common tangent 
construction for the free energies (/, T). Such a construction was described by 
Huo and Muller [17] in connection with a shape memory rod from a minimization 
of the free energy. 
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Abstract. In a few papers published in the 1940’s, the Italian mathematician Eugenio Frola ( 1 906 — 
1962) proposed an axiomatic formulation of the theory of linear elasticity as an autonomous branch 
of mathematics, and he sketched a program for the logical foundations of that mathematical theory. 
He gave only some general principles and a few hints, without obtaining any definite axiomatic 
structure in the sense of the fundamental work of W. Noll and C. Truesdell a few years later. 
Nevertheless, Frola’s attempt of finding rigorous axiomatic foundations of classical elasticity must 
be acknowledged as a very first step in the right direction. 
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To Clifford Truesdell, Master of Science and Life 



1. Introduction 

As it is well known, at the beginning of the XXth century the theory of elasticity 
was widely developed in Italy, to the point that Felix Klein claimed that “elasticity 
was the national question of Italians”. In 1935, in the annual report of the Ital- 
ian Society for the Progress in Science (SIPS), it was remarked that, in a rather 
unpleasant period for Mechanics in Italy, two fields were still flourishing: Hydro- 
dynamics and Elasticity. Eugenio Frola was paid of the scientific group working in 
linear elasticity and mechanics of structures and he showed a marked interest in 
generalizing or trying to generalize any of the many problems studied by himself 
or by his colleagues in different fields, even if he dealt mainly with the theory 
of elasticity. He was variously attracted by problems in engineering, structural 
mechanics, vibrations, physics, functional analysis, PDE, integral equations, and 
economics. His efforts were always directed not only to solving particular problems 
but also to finding more general formulations which could satisfy, even partially, 
his expectation of pointing out the rigorous logical structure that ought to underlie 
every theory. 
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In 1940 Frola wrote two papers, in which he sought to develop a systematic 
approach to nonlinear elasticity, starting from a linear model. In Section 3 we 
examine this attempt, first discussed in an article [1] that appeared in II Saggiatore, 
a journal devoted to the dissemination of science among cultivated people, then in 
a more technical paper published in the Proceedings of the Academy of Sciences 
of Torino [2], where he went deeper into the question. In Section 4 we turn our 
analysis to a paper which appeared in 1948 in Atti del Centro Studi Metodologici, 
a largely unknown scientific journal, where he went through the principles of the 
theory of elasticity and tried to state them in a rigorous way, to point out the basic 
ideas that were applied, consciously or unconsciously, in the treatises and papers 
on this subject. Indeed he realized that there was a need for a logical, rigorous 
mathematical formulation of the theory of elasticity, both linear and nonlinear, and 
he would have liked to pursue this task, but his diverging interests and his ill health 
stopped his researches. 

In this paper our goal is to present and comment on the ideas of Frola on this 
specific topic, which was disregarded by most of his contemporaries, but funda- 
mental to any modern theory of elasticity, or, more generally, of continua. 



2, Sketch of a Biography and Comments on Previous Works 

Eugenio Frola was born in Montanaro, a small village at the feet of the Alps, near 
Torino, on September 28, 1906. He obtained two degrees, one in Civil Engineering 
from the Politecnico of Torino in 1929, and the other in Mathematics from the 
University of Torino in 1933. Frola was a student of G. Albenga and G. Fubini, the 
first working in Mechanics, the second in Analysis with a taste for the applications 
to elasticity and mechanics (for a brief sketch of their activities see [3]). He became 
Assistant to Fubini and to F.G. Tricomi, an outstanding mathematician who worked 
in Analysis and is famous for the PDE hearing his name (see again [3]). After 
a few years Frola obtained a position as Professor at the Politecnico of Torino. 
On February 24, 1940, he was elected Member of the Academy of Sciences of 
Torino, one of the oldest Italian academies, founded in 1759 by J.F. Fagrange 
among others. Between 1932 and 1955 Frola published 37 papers, mostly in the 
Proceedings of the Academy of Sciences of Torino and the Accademia dei Fincei. 
Thereafter he suffered from a fatal illness, which lasted several years. He turned 
his mind to Buddhism and translated a couple of Buddhist books, the second one 
being published after his death, which happened on May 6, 1962. 

His severe health problems diverted him from research and he did not complete 
his program to build logical foundations for the theory of elasticity, thus prevent- 
ing him from obtaining the results and the recognition he deserved. According to 
the philosopher of science F. Geymonat, who edited in 1964 a book of Frola’s 
collected papers [4], Frola’s contributions to epistemology were highly significant, 
even more so than it would appeal - from his dense but few and schematic essays on 




E. FROLA: AN ATTEMPT TOWARDS AN AXIOMATIC THEORY OF ELASTICITY 



115 



foundations and methodology (see the introduction of the volume [4, pp. 7-33]). 
A brief analysis of the works of Frola can be found in [5]. 

Frola’s work is marked by some characteristic features that distinguish his ap- 
proach from that of the other Italian mechanicians of the same period. In his 
research he always tries to generalize known or new results, relaxing some hy- 
potheses or putting in evidence analogies with other disciplines. In the first period 
(1932-1940) his researches were devoted to satisfying these needs: for instance, 
Frola [6] tried to extend to dynamics the Colonnetti and Castigliano versions of 
Betti’s reciprocity theorem, even though it was D. Graft! who scored success in 
1939 and 1963 (see [3, pp. 414-417]). It is worth noting that in the paper [7] 
published in the Acta Pontificia in 1938, the abstract is in Latin as requested: 
“Auctor ostendit theorema Colonnetti circa systemata elasto-plastica nihil aliud 
esse nisi hypotheses ilia fundamentals de deformationis omnimodae congruentia. 
Docet etiam rectam quamdam elementariam rationem, qua opus nos est algorithmis 
minimizantibus uti”. Hence we can see that the use of Latin in scientific papers did 
not disappear completely, even in the XXth century, and the revival of this language 
in a couple of papers by C. Truesdell signified the continuation of a use inherited 
from great scientists of the past. In the same period Frola applied to elasticity 
techniques of quantum mechanics [8], namely: he used the Dirac delta function and 
infinite dimensional vector spaces to obtain the equation of elastodynamics from 
a microscopical approach. In other researches he made use of linear functionals 
to obtain integral equations as field equations of elasticity, where the influence of 
Volterra, who had been in Torino from 1893 to 1900, is evident. 



3. A First Step: Linear and Nonlinear Theory of Elasticity 

In 1940 two papers of different style but with the same objective appeared [1, 2], 
In these papers Frola explored the mathematical foundations of linear elasticity as 
this theory was established in his times. His puipose was to make clear which parts 
of lineal - elasticity would remain unchanged and which parts had to be modified if 
the nonlinear theory were to be developed. Clearly this is not a trivial problem. By 
now we understand that it would be much more correct to develop an exact theory 
first and then linearize in a mathematically rigorous way, but it was not natural 
at all in those years, when elasticity meant linear elasticity for almost everyone 
(and even nowadays one can find in textbooks of physics such an identification). 
The first paper was published in the first issue of the journal II Saggiatore, the 
puipose of which was to disseminate the more advanced and up-to-date results 
in sciences among scientists themselves with papers that would avoid technical 
complications but expose the main ideas. In the editorial board there were, among 
others, F.G. Tricomi (whom we have already mentioned in the preceding section) 
and G.C. Wick, an outstanding physicist. In this paper Frola explained his ideas 
in an almost colloquial way and, before tackling the main question, he gave a 
simple sketch of what he thought were the main differences between elasticity 
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and plasticity, by means of simple ideal experiments as follows: if small rods of 
steel and lead arc strained and their deformations during loading and unloading arc 
measured, the steel rod will recover the initial configuration, while the lead rod will 
“remember” the deformations undergone. Here Frola uses a rigorous language: “If 
we want to make use of the language of modern analysis, we would say that [the 
displacements in the lead rod] arc no more a function of the force, but a functional 
depending on the force, which is a function of time, a continuous functional in the 
sense that small variations of the law with which the force has changed in time 
cause small variations in the displacements” [1, p. 75]. The influence of the works 
of Vol terra, Pincherie and Amaldi on functionals is clear, but the idea of applying 
such techniques to plasticity, in this context, is due to Frola. 

Returning to elasticity, Frola realizes that elastic bodies can be subjected to 
the phenomena of buckling when the load exceeds some critical values and it is 
not contradictory to the assumption of elastic material, but it is a nonlinear - effect: 
“Nevertheless, even after the abrupt variation in the behavior [the buckling of an 
elastic bar - , authors’ remark], the body continues to be elastic (no more in the linear 
sense) [. . .]” [1, p. 78]. In fact, Frola was interested in the problem of buckling 
of structures not only for its many different applications, but also for a deeper 
analysis of the theory of elasticity. While his contemporaries devoted their attention 
to many detailed and particular - problems, which led to engineering and structural 
mechanics (see the comment by Truesdell and Noll in the final remarks below), 
he spent his efforts at a crucial point, which was not a feature only of buckling of 
rods, but a basic problem of the theory of elasticity: the correct formulation of a 
nonlinear - theory. He was aware of the paper of Trefftz on buckling [9], as we will 
show later, but his interest was differently orientated and he was not satisfied with 
the way this problem had been treated. 

Frola concludes that, for such bodies under such circumstances, the principle of 
superposition of the effects is no longer valid. On the other hand, at that time the 
experimentalists assumed that in elastic solids this principle held and, in order to 
be consistent with this experimental hypothesis, the linear - model must be assumed. 
Finally, to justify buckling effects the nonlinear - model must be used, and the main 
assumption to be rejected is the principle of superposition of the effects, and ex- 
periments must take into account this new fact. Then Frola distilled what he took 
to be the essence of linear - elasticity, as summarized in two axioms: 

1. The principle of invariance of the response in a sequence of subbodies, each 
one included in the preceding one, down to a body point: in the words of Frola, 
“discesa dal globale al locale”, which means descent from global to local. In 
other words, we can claim that an elastic body consists of elastic subbodies 
and, in the limit, of elastic points. 

2. The existence of a local elastic energy function as a positive definite quadratic 
form in the first derivatives of the displacement ( components ): “l’energia poten- 
ziale elastica locale sia in particolare una forma quadratica, definita positiva, 
nelle sole derivate prime dello spostamento, invariante per moti rigidi infinites- 
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imi [. . where the term “invariante” must be referred to the energy function 
(ibidem, p. 79). 

In fact. Axiom 2 is partially a consequence of Axiom 1 : if we assume the exis- 
tence of a global elastic potential energy, because of Axiom 1 we can obtain it as the 
“sum” of the elastic potentials of the parts which, we imagine, compose the body, in 
the limit even when these parts arc elastic points. Then we can define a local func- 
tion (the local elastic potential energy), which depends on local properties of the 
displacement, and obtain the total energy by integration over the body. Since Frola 
was mainly interested in static problems, he added the unnecessary assumption that 
the elastic potential be positive definite and used the virtual work principle (which 
is a more general assumption in mechanics) to build up an equilibrium theory for 
linearly elastic solids. 

He ends his paper with the following proposal: “Le basi e i fondamenti delle 
teorie non lineari, che permettono una visione piu ampia dei fenomeni elastici 
generali, potranno formare argomento di un ulteriore esame” [The basis and the 
foundations of nonlinear theories, which allow a broader view of general elastic 
phenomena, can be in the future a subject of further investigations] [1, p. 80]. 

The next step appeared the same year. In a longer and more technical paper [2], 
Frola proves that buckling phenomena cannot be included in the I i near theory of 
elasticity and the difficulty lies in the principle of superposition, which he calls here 
the first postulate. In fact, the same remark was already made in [1], but now the 
approach is more detailed and mathematically consistent. Again Frola begins from 
a critical analysis of the postulates of the “ordinaria” (in our language it means lin- 
eal - ) theory of elasticity, remarking that the very basic postulate is the supeiposition 
principle. It is a phenomenological principle, from which both a global and a local 
theory ensue: in the first case it is expressed by means of integral field equations, 
and in the second case by differential equations. In a global theory this principle 
means that the stress-strain relations are expressed with linear functionals and that 
the local equations thus obtained are not necessarily linear differential equations. 
On the other hand, Hooke’s law is a physical assumption typical of linear theories, 
requiring the algebraic lineality of the stress-strain relations. But, as Frola clearly 
proves, the two assumptions are not equivalent: neither global lineality implies 
Hooke’s law, nor does the vice versa hold obviously. It follows that the first pos- 
tulate can be saved, but must be generalized: a global principle of supeiposition is 
required. 

A second postulate is added: “I quadrati delle derivate prime degli spostamenti 
siano trascurabili rispetto alle derivate stesse” [the squares of the first derivatives 
of the displacements must be negligible with respect to the derivatives themselves] 
[2, p. 535]. 

A third postulate concerns the loads: in modem language it states that only 
dead loads are taken into consideration. At this point, almost incidentally, Frola 
remarks that, if we admit the existence of a strain energy function, the matrix 
of the coefficients in Hooke’s law must be symmetric and vice versa. This is a 
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consequence of Betti’s reciprocal theorem, as it was clearly proved by C. Truesdell 
[10] in the more general context of finite elasticity. (Truesdell proved that Betti’s 
reciprocal theorem is a sufficient and necessary condition for an elastic material to 
be hyperelastic.) 

Finally, Frola rephrases the axioms listed in the previous paper, which arc nec- 
essary to obtain a linear theory. In modern language they can be summarized in the 
following way: (i) the body is linearly hyperelastic, (ii) the displacement gradient 
is “small”, and (iii) the external loads arc dead loads. 

Returning to the problem of buckling of elastic structures, Frola proves that the 
first and third postulates arc not affected by this anomalous behavior, while the 
second postulate fails: experience shows, for instance, in the well-known case of 
the elastica, that we can have small deformations but large first derivatives. Hence 
Frola proposes to modify the strain-displacement relations, as follows: 



1 2 1 2 1 2 1 1 1 2 1 2 

S xx — e xx + 2 S xx g e xy + g ~ Z^zx^ "^2^ ~2~ ’ 



( 1 ) 



and analogous expressions for the other components of the strain, where e xx , e yy , 
e zz , e xy , e yz , e zx arc the classical linear strain components in a Cartesian coordinates 
system (x, y, z), (u, v, w) the displacement components and (p, q, r) the compo- 
nents of the local rotation. In fact, according to Frola’s notations, the s xx , s yy , . . . 
are the components of what we now call the Lagrangian strain tensor and the 
e xx , e xy , . . . , p, q, r can be written in terms of the displacement gradient H, 

namely as: e xx = E xx , e xy = 2 E xy p = —W yx , where E xx , . . . , W xy , ... are 

the components of: E = Sym H, W = Skw H. Assuming that the squares of the 
lineal - strain components and the mixed terms are negligible, while the squares of 
the rotations are admitted, one can obtain the nonlinear relations: 



£ X x — &xx T 2 t ”)i £yz — £yz tZA • • • • (2) 

By using the first axiom the local elastic potential energy density takes the form: 

W = Wq + (A + p)0(p~ + q~ + r") + — (A + p,){p 2 + q^ + r 2 ) 2 

- lAe xx P 2 + e yy q 2 + e zz r 2 + e xy pq + e yz qr + e zx pr ] , (3) 



where Wo is the quadratic elastic strain energy density used in the linear theory 
and 6 = (e xx + e yy + e zz ) the ordinary cubic dilatation. In a few final lines Frola 
states a variational principle by means of the virtual work theorem and refers to a 
forthcoming paper with applications and local field equations given explicitly. But 
the paper never appeared. 

Let us remark that this critical review of the foundations of elasticity was carried 
out by Frola in a totally independent way. Analogous researches were carried out 
at the same time only in the former Soviet Union and were revealed to western 
scientists through the treatise by Valentin Valentinovich Novozhilov (1910-1987) 
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when its English version appealed in 1953 [11]. The subject was then definitively 
cleared up in a general context by Truesdell and Toupin [12]. 

The different “weights” attributed to displacement gradients and local rotations 
became a common feature after the works of Naghdi and others [13, 14], where 
the concepts of small displacements and moderate rotations were introduced. This 
modern terminology was not used in [2], but Frola explicitly said that the defor- 
mation components e xx , e yy , e zz are of the first order and the squares of the local 
rotation components p 2 , q 2 , r 2 must be of the same order. In other words, we do not 
deal with infinitesimal deformations and finite rotations, as misunderstood by many 
people, but small rotations, with a different order of infinitesimality with respect to 
the deformations. The definition of Frola in [2] fits exactly with the definition given 
by Naghdi and Vongsarnpigoon [14, Section 4.3, p. 282]: “Given [a small strain ] 
E = O(so), a proper orthogonal tensor R is said to be a moderate rotation with 
respect to e Q if for any unit vector v, the vector [3 defined in (4.29) [ fi = Rv — v] 
satisfies: /? = 0 (£q / 2 ) as e —>■ 0” [14, p. 282], 

Nobody, but one, noticed this new idea of Frola. The reason is that the nonlinear 
theory of elasticity was introduced independently in the 1930’s by A. Signorini 
in Italy and by C. Murnaghan in the USA. The approach of Signorini (and Mur- 
naghan, but he was not known in Italy) is completely different from Frola’s. Sig- 
norini did not pay much attention to the logical foundations of his theory. He put his 
efforts in developing techniques useful in solving some problems he had in mind, 
which could not be treated by the linear model. Unfortunately Signorini chose 
the “dreadful notation” [C. Truesdell, verbatim ] of the homographies, imposed in 
Italy by C. Burali-Forti and T. Boggio, instead of tensor calculus. This awkw ard 
formalism and the fact that the papers were written in Italian made the work of 
Signorini almost unreadable outside Italy. Only C. Truesdell took the trouble to 
scrutinize it thoroughly, so that many of Signorini ’s results, clarified and cleansed, 
appeared in the treatise of Truesdell and Toupin [12], with the credits he deserved. 

The only one who noticed Frola’s results was Placido Cicala (1910-1996), a 
professor of civil engineering at the Politecnico of Torino. Cicala had just written 
a paper [15] in which he proposed to modify the classical equations of equilibrium 
of lineal - elasticity in order to allow critical and postcritical behavior in loaded 
structures, i.e., again the buckling problem. As pointed out by Frola [2, p. 532], 
this approach can lead to a family of contradictory theories “in quanto partenti 
da risultati acquisiti dalla teoria ordinaria dell’ elasticity che deve d’altra parte nel 
corso della ricerca essere negata nei suoi postulati” [because it is based on results 
obtained from the linear theory of elasticity, which, on the other hand, must be 
denied in its postulates during the same research], Frola means that authors like 
Cicala modified the linear theory by adding ad hoc terms in the strain-displacement 
relations and in the constitutive relations relative to some specific problem, and in 
this way they contradicted, without noticing it, the postulates of linear elasticity 
they were still using. Cicala [16], in a one -page footnote of a paper devoted to the 
nonlinear theory, rejected the above-mentioned critical remark of Frola, and tried 
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to prove with a simple numerical example that Frola’s deductions were wrong. 
Incidentally, Cicala provided evidence that Frola was aware of the paper of Tre- 
fftz [9]. In a footnote on page 95, Cicala remarks: “(')... This note represents an 
appreciable development of the theory whose foundations arc stated by Trefftz in 
his master paper . . and, on page 97, we find another footnote: “This [ variational] 
procedure is used by Trefftz in the paper quoted above. Frola in his future notes, 
will adopt, as he announces [ annuncia ], such method”. The Italian verb “ annuncia ” 
used in this remark means that probably Frola mentioned this paper to Cicala in 
some private discussion, because he never wrote such a sentence. 

Cicala deals with a rigid rotation around the "-axis of angle a, which produces 
the displacement: 

u = x (cos cr 
v = y(coso! 
w = 0. 

Assuming the approximations required by Frola, Cicala obtains the explicit 
form for the deformation components s xx : 

du 1/3 u dw\ 2 1 / dv du\ 2 

xx dx + 8 \3z dx ) 8 \3x dy) 

which is equal to the expression (2). By substitution of (4) in (5) Cicala finds the 
following value for the first deformation component: 

Sxx — —(cos a — l) 2 (6) 

and since “the rotation a can be any, it can even result = 2” (indeed, as Frola 
will remark: = —2). And this is in contradiction with the hypothesis of small 
deformations. 

In fact, as Frola himself wrote in [17], Cicala confused “infinitesimal” and 
“small” quantities: “the theory [. . .] is obviously a limit theory which intends to 
represent phenomena not encompassed in the classical [= linear ] theory, but it 
does not pretend to be either a theory of finite displacements or to interpret any 
elastic phenomenon”. Frola tried to explain that, when we deal with “infinitesimal” 
quantities, they must not necessarily be of the same order, and, in any case, the 
example is misleading. The same rotation inserted in the formula (3) of Cicala’s 
paper will produce the strain e xx = cos x — 1 and Cicala claims that this result is 
correct, because it is of the same order as the deformation. It is easy for Frola 
to argue that e xx is of order a 2 , while e xx is of order cr 4 ; hence the rotations 
arc of a different order of “smallness” with respect to the pure deformations, as 
postulated. 

Finally, one must not confuse “small” with “finite”: “the theory, . . . , does not 
pretend to be a theory of finite displacements .... All the first derivatives of the 
displacement components must be considered small, . . . not necessarily of the 



— 1) — y sina, 

— 1) + x sin cr, (4) 
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first order” [17, p. 259]. Moreover the criticism of Cicala does not apply to this 
model “because it acts on a field (finite displacements) which is not included in my 
theory” ( ibidem , p. 260). 

It has already been remarked that Frola’s point of view is consistent with the 
theory of small deformations and moderate rotations as developed by Naghdi and 
coworkers [13, 14], who surely did not know the papers of Frola. 

Then, having been provoked, Frola proved that the counterexample produced 
by Cicala could be easily adapted to the classical linear elasticity; moreover, he 
claimed that his own theory was not rigorous, but sufficient to interpret some phys- 
ical facts, and that he was not dealing with a finite displacement theory. Moreover, 
Frola showed that in Cicala’s paper [15] there were some “weak points”, not to say 
mistakes and “inesattezze alquanto pericolose” [quite dangerous inaccuracies], as, 
for example, the confusion of the modulus of a derivative with the derivative of the 
modulus, the use of wrong rules of approximation, and finally, the statement that “if 
a is very small, fya is very small too” [15, p. 213], which is misleading. In fact, “if 
a goes to zero, so does fya, but if a = 10 -4 (really small), ffa = 10 -1 / 2 ~ 0.315, 
which I would not say to be very small” [17, p. 262]. Cicala did not reply and the 
debate passed unnoticed in the Italian scientific community. 



4. The Methodology of Mathematics and the Last Paper (1948) 

At the end of World War II, Frola turned his mind to new perspectives: he was 
mainly, if not exclusively, interested in problems related to the foundations and 
methodology of mathematics. This change must surely be connected with the es- 
tablishment of the “Centro Studi Metodologici di Torino”, whose founders were 
prominent scientists working at the University and at the Politecnico of Torino: 
mathematicians, physicists, chemists, economists, etc. It was a smaller version of 
the Vienna Circle: a private association of scholars, linked by a common interest in 
logical foundations of science. Frola was very active, from the beginning, even if 
he was interested almost only in mathematics and its applications and did not seem 
to be too much influenced by the Austrian neopositivism. On the history of the 
Centro the reader is referred to a very complete article by Giacardi and Roero [19]. 
Included in the above mentioned book [4] (edited by Geymonat and published in 
1964) arc Frola’s main papers of this period, all concerned with mathematics and 
its relationship with other sciences, including engineering, physics and economics. 
This last subject is discussed in detail in his most celebrated methodological paper, 
written with the economist Leoni [20]. Here Frola strongly denies that it would be 
possible to apply mathematics to economics in the same way that it is applied to 
physics. He shows that other mathematical techniques arc needed, most of them 
not even known to the economists of that period. Here we have no intention to go 
into the details of this paper, which contains many interesting critical comments 
and suggestions, but it is worth noting that at the beginning criticism is levelled to 
those who blindly use mathematics without caution: “Writing down a system of 
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mathematical relations is useless [. . .] and meaningless, if theorems of existence of 
the solutions arc not proved and techniques of approximations arc not determinated 
[...]. This solution will be considered satisfactory, and the system of mathematical 
relations efficient, if it [. . .] will provide us a good accuracy” [20, p. 88], At the 
end of the same paper Frola attacks again the improper use of mathematics and 
quotes a book by P.A. Samuelson, Foundations of Economic Analysis (Cambridge, 
1948), where, on page 10, a system of n equations involving some not well de- 
fined “functional relationships” in n unknowns and m parameters is considered. 
Samuelson deduces that any of the unknowns can be expressed as a function of the 
parameters. After some sharp considerations, Frola concludes: “[. . .] we cannot 
but regal'd this approach as a free and easy use of mathematics. One could find 
countless examples of such uninhibited use of mathematics in economics, which 
are not considered here [. . .]” [20, p. 109]. 

Pursuing his new interest in the philosophy of mathematics, in 1948 Frola wrote 
[18] a paper on the logical foundations of the theory of elasticity, where he does 
not enter into the details of an axiomatic structure, but simply tries to fix the funda- 
mental concepts of the theory. It is sort of a first draught, which was never followed 
by a more complete work. Its relevance is due to the fact that almost certainly it is 
the first attempt, together with the papers of 1940-1942, to formalize some basic 
axioms of the theory of elasticity. 

Frola makes a distinction amongst the characteristics of the axioms, dividing 
them into three families. A similar distinction in different groups of hypotheses 
can be found in the general and classical works of W. Noll and C. Truesdell, even 
if, obviously, the classification of Frola is simpler, more schematic, and even naive. 
In Frola’s scheme, there are three groups of axioms, each one containing three 
hypotheses. The first group contains “geometrical” assumptions: 

1. An elastic body is a domain in an Euclidean 3-dimensional space (what we can 
call a “placement” of the body). 

2. A vector field, representing the deformations, is defined over this domain. 

3. This vector field must satisfy suitable smoothness conditions. 

The second group contains the “mechanical” assumptions: 

4. The loads are described by vectors, with the same properties of forces in rational 
mechanics. 

5. “On the existence of tensions”: internal tractions as surface forces are defined 
by the assumption that a surface force acts on the boundary of any subbody and 
represents the action of the remaining part of the body over the given subbody. 

6. The nature of loads and tensions must be defined in a precise, mathematical 
way, including hypotheses analogous to those required in point 3. 

It is cleai' that such a scheme presumes a more general theoretical framework, 
where the forces (i.e., volume and surface forces) must be introduced according 
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to some general theory of rational mechanics. The third group is called “physical 
hypotheses” and concerns more specifically 1 i near elasticity: 

7. Lineality: there holds the superposition principle, as introduced in [1], 

8. The displacements arc small, in the usual sense. 

9. “[DJiscesa alTinfinitesimo”: all the preceding assumptions arc valid for any 

subbody of the body, down to the smallest part of the body, namely a point, 

taken as the limit of a sequence of shrinking domains ordered by inclusion. 

This final assumption, of a nature analogous to assumption 5, neither has the 
support of experimental evidence nor represents any physical experience, but is 
the result of a “metaphysical attitude” necessary to apply the macroscopic con- 
tinuous model, which allows the derivation of differential equations as field equa- 
tions. 

In his final remark, Frola expresses his dissatisfaction with this inadequate con- 
struction. He realizes that it is not the only possible approach, and the mathematical 
language used here is not even the best mathematical language one can imagine. 
Surely there are gaps to be filled, and he hints vaguely that he would continue his 
researches in this direction. 

But Frola soon turned his mind away from elasticity and mathematics. He wrote 
a few papers about methodology of mathematics, including the aforementioned 
one on economics, but he never continued his interesting program on axiomati- 
zation of the theory of elasticity and became wholly immersed in his studies on 
Buddhism. 

In conclusion Frola’s ideas, even if his construction has not been completed, 
remain exemplary for their consistency and coherence, and they stimulate fur- 
ther work to search for rigorous logical foundations of theories, for physically 
meaningful hypotheses, and for correct applications in different fields. 

The last lines of this paper [18, p. 14] arc a sketch of a program: “Rimane 
quindi allettante, e dal punto di vista critico e da quello applicative della scienza 
delle costruzioni, il tentare altre vie per la costruzione di nuovi complessi teorici 
destinati a descrivere scientificamente il fenomeno elastico”. [It remains hence ap- 
pealing, both from critical and applicative points of view in materials science, to try 
different ways to build up new theoretical structures apt to describe scientifically 
the elastic phenomenon.] 

Frola failed, but this task would be successfully fulfilled a few years later by 
W. Noll, C. Truesdell and others, in complete generality, not only for elastic ma- 
terial, but also for a broad class of materials, within the framework of Rational 
Continuum Mechanics. 



5, Final Remarks 

Even if Frola did not obtain the results he pursued, he felt the need of consti- 
tutive equations for the nonlinear - theory of elasticity, and by extension for other 
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theories based on a mathematical model. In this sense he realized beforehand the 
significance of one of the basic points in Rational Continuum Mechanics. The con- 
stitutive equations, correctly formulated, arc fundamental in providing a rigorous 
framework and the logical foundations of a physical-mathematical theory. They arc 
based on different sets of axioms, as Frola explicitly said, and his classification is 
very similar, even if much sketchier, to the classification used nowadays. 

The historical impact of Frola’s papers is slight, because they were few, incom- 
plete and unknown, but it is surprising that he, isolated and pathetic figure of a 
scientist, working outside the main stream of interest of his time, tried to find the 
logical structure of the theory of elasticity, both linear and nonlinear. 

The main interests in Italian mechanics between the 1930's and the 1950’s 
were surely not to build up general theories of continua, but to solve many par- 
ticular problems, applying analytical tools, often complicated and sophisticated, 
with the goal of proving existence, uniqueness, and regularity of the solutions of 
various differential equations. In other countries different models were developed, 
mainly linear, to study phenomena such as plasticity, dynamics of fluids, structural 
mechanics, etc., and again many particular' problems were solved, many results 
were found, sometimes interesting, but “most of which have later turned out to be 
unnecessary in the cases they arc justified. Knowledge of the true principles of the 
general theory seems to have diminished except in Italy, where it was kept alive 
by the teaching and writing of Signorini” [21, p. 9]. Nevertheless the relevant work 
done by Signorini, one of the pioneers, with Murnaghan, on the nonlinear - theory of 
elasticity, was ignored until C. Truesdell read his difficult papers (written in Italian) 
and noticed their significance. 

Frola did not follow the lead of Signorini. His approach was completely dif- 
ferent: his goal was to find the logical structure at the basis of a theory in applied 
mathematics and, even if he solved some particular - problems in elasticity, he was 
aware that the theory of elasticity, as well as other theories, had weak foundations 
and it was necessary to reconstruct them firmly. 

There are no important theorems in his papers, but we would like to point out 
that Frola tried to go deeper into the logical foundations of linear - and nonlinear 
elasticity, while his contemporaries were not interested in it. He was not able to set 
the theory of elasticity on a satisfactory basis, but we must recognize that his was 
one of the very first attempts in this direction. 
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In memory of Clifford Ambrose Truesdell, our teacher and mentor 



1. Lagrangian and Hamiltonian Descriptions of Elastic Bodies with 
Substructure 

In standard continuum mechanics, each material element of a body is “collapsed” 
into the place occupied by its centre of mass; let X be that place in the reference 
placement; the set of all X is taken to be a fit region fBo of the three-dimensional 
Euclidean space 8. 

Sometimes such simplicistic model of physical reality is insufficient; then, to 
render the picture adequate, the material element must be portrayed as a system and 
at least some coarse grained descriptor v (an order parameter) enters the picture. 

Here, as in [1] (see for other details and additional results [2-5]), we take v as 
an element of some differentiable manifold M, and presume that physical circum- 
stances impose a single choice of metric and of connection for M. 

We also assume that the region occupied by the body in the current placement 
be obtained through a sufficiently smooth mapping x: f8o — > 8\ so that the current 
place of a material element at X in 33 o is given by x = x(X); and 33 = x(33q) 
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is also fit. We denote as usual with F the placement gradient. We presume also 
that another sufficiently smooth mapping v: JBq — > M shows the value of the order 
parameter at X, namely v = v(X). A motion is a pair of time -parametrized families 
x,(X) = x(X, t) and \>, (X)) = v(X, t), twice differentiable with respect to time. 
Rates in the material representation are indicated with x(X, t) and v(X, t ), and we 
will write x and v for brevity. 

We restrict here our attention to bodies for which a Lagrangian density X exists, 
so that the total Lagrangian L of the body is given by 

Ls 0 = I X(X,x,x,F,v,v,Vv)d(vol), (1) 

0 

the gradient V v being based on the mandatory connection.* We presume that X be 
of the form 

=C(X, x, x, F, v, v, Vv) = ^p 0 ||x[| 2 + pox(v, v) — Poe{X, F, v, Vv) 

- p Q w(x. v), (2) 

where p Q is the referential mass density (conserved during the motion), x the kinet- 
ic co-energy (see [1, p. 19]) associated with the substructure, e the elastic energy 
density and w the density of the potential of external actions, all per unit mass. 
Below we use the notation b = — d x w for the density standard external actions 
and /3 = —3 v w for the substructural ones. The kinetic energy density Pqk(v, v) 
pertaining to the substructure is the partial Legendre transform of / with respect 
to v. 

If X is sufficiently smooth, we may apply standard procedures to derive Euler- 



Lagrange equations for the functional L <8n : 

3 xX = d x X — Div 3f=C, (3) 

d{,X = d v X — Divds/ v X, (4) 

where Div is the divergence calculated with respect to X, i.e. Div = trV. 

Put 

5) = x • d x X + i’ ■ d{,X — X. (5) 

* The pair (v, Vv) collects the peculiar elements of the tangent mapping Tv\ SSq x Vec -*■ T M, 



where Vec is the translation space over 6, and we identify £ o x Vec with T £q. More specifically, we 
have rf>(X): Txi S q — >■ T V M. Such elements cannot be separated invariantly unless M is endowed 
with a parallelism (and one wants also to have a physically significant parallelism). A similar remark 
holds also for each element (v, v) of T M. The Lagrangian density is then defined on 

■So x 8 x Vec x Hom(Vec, Vec) x TM x Hom(T £ q, T M) 

(satisfying some compatibility conditions of possible various nature, depending on the substructure), 
with Hom(A, B) the set of linear transformations between A and B. 
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Clearly, Sj is the density of the total energy. In fact, since d;,X = podoX> the term 
v ■ d;,x - / in (5) coincides with the substructural kinetic energy density k(v, v) 
(hence the presence of x rather than k in the expression of X), then 

1 • 9 

Sj = -pollxll + Pqk(v , v) + p 0 e(X, F, v, Vu) + p 0 w(x, v), (6) 

as asserted.* 

The balance of energy can be expressed in terms of S) as follows 

h - D/v(xP + v-S) = 0, (7) 

where P and S arc respectively the Piola-Kirchhoff stress and the referential mi- 
crostress 

P = — 9f=C, $ = — ds7 V X. (8) 

That (7) is true follows from direct computation. Notice that, in view of our 
hypothesis on the existence of a unique, physically significant connection for M, 
concrete meaning can be assured for microtractions Sn which represent inter- 
actions between neighboring material elements. As already remarked in various 
occasions in [1] (see, e.g., pp. 26, 27), in general, a properly covariant separation 
between ‘self-actions’ (— d v X ) and ‘microstresses’ (— dy v £) does not apply. 

Equations (3) and (4) lead us to an appropriate version of Noether theorem 
(see [4, p. 29]); here we follow the program of [6, p. 284]. We consider some virtual 
motion of our system, by assigning two one-parameter families f ' of sufficiently 
smooth point valued diffeomorphisms, i = 1,2, acting respectively on So and 8, 
and a Lie group G of transformations of M. We indicate with a prime the derivative 
with respect to the relevant s. 

1. At each s\, fj acts on So so that X i — > f' (X) e 8 , and is isocoric (no virtual 
change of density), i.e., Div fj' = 0; f', is the identity. We put fJ'(X) = w. 

2. At each ,v 2 . f ” is a diffeomorphism that transforms 8 into itself. We assume that 
fg is the identity and put fj'(x) = v. 

3. A Lie group G, containing 50(3), acts on M. Let £ be an element of the Lie 
algebra of G. The infinitesimal generator of its action on v e M is indicated 
with fu( v ) (see [7, p. 256]); v g is the value of v after the action of g e G. If we 
consider a one -parameter trajectory ,v 3 g Si e G such that is the identity, 

* We do not identify a-priori the substructural kinetic co-energy x with its Legendre transform k 
(assuming thus for both the traditional quadratic form), to encompass cases in which such a distinc- 
tion is necessary to capture prominent physical phenomena. Excluding for a while exotic cases, in 
fact, even when k(v, v) = (l/2)v“£2 Q , / g v^, with f2(v) e Sym + (T V M, T V M), a priori / differs from 
k by an addendum of the type X a v a , with A(v) e T*M. Although such an addendum is commonly 
neglected, it becomes prominent in some cases as, e.g., when the substructural kinetics has rotational 
character, i.e., when the anholomic constraint v = Am holds, with m an arbitrary vector. Such a 
circumstance occurs in magnetostrictive solids for which m is the magnetization and one obtains 
from (4) the standard Gilbert's equation (see [13] for the relevant calculations). 
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we have also s 3 i— v gs ^ and ^(v) = (d/ds 3 )v ?J3 | J3=0 . When G coincides with 
the special orthogonal group SO(3), we identify r>) with .yfq, where q is 
the characteristic vector of a rotational rigid velocity and A a linear operator 
mapping vectors into elements of the tangent space of M. namely, if v q is the 
value of the order parameter measured by an observer after a rotation q, then 
<A> = (dv q /dq)| q=0 . 

Henceforth, to simplify notations, we use f 1 , f 2 and v g to indicate fj (X), f 2 (x), 
v gs (X), and write | 0 for | s , =o.s-t=o..s 3 =o- Moreover, grad indicates the gradient with 
respect to x. 

We say that X is invariant with respect to fj ’s and G when 
X(X, x, x, F, v, v, Vv), 

= ^(f 1 , f 2 , (grad f 2 )x, (grad f 2 )F( Vf 1 )“ 1 , v g , v g , (V^XVf 1 ) -1 ). (9) 

Let us define 

Q. = dxX ■ (v — Fw) + di,X ■ ($ M (v) - (Vv)w), (10) 

d=Xw+ (3 F c£) t (v - Fw) + (dy v X) J (f M (v) - (Vv)w), (11) 

where v, w and ^(v) arc as mentioned in items 1, 2, 3. 

THEOREM 1 (Noether-like theorem for complex materials). If the Lagrangian 
density X is invariant under f] , f 2 ^ and G, then 

a + Divd = 0. (12) 

Proof. To prove the theorem, as a first step we note that (9) implies 

^(f 1 , f 2 , (gradf)x, (grad f 2 )F(Vf 1 ) -1 , v g , v g , (Vv g )(Vf l )~% = 0, 

(13) 

(f 1 , f 2 , (grad f 2 )x, (grar/f 2 )F(Vf 1 )“ 1 , v g , v g , (Vv ? )(Vf 1 ) -1 ) | 0 = 0, 

CIS 2 

(14) 

f 2 , (grad f 2 )x, (grad f 2 )F(Vf') _1 , v e , v g , (Vv g )(Vf l )~ l ) | 0 = 0, 

Q53 

(15) 

which lead to 

d x X ■ w — d F X ■ (FVw) — d Vv X ■ (VvVw) = 0, (16) 

d x X ■ v + <hX ■ ((grad v)x) + cf-X ■ ((grad v)F) = 0, (17) 

d v X ■ £,m( v ) + Sr X ■ %' M (v) + dy v X ■ V^, w (v) = 0, (18) 
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as a consequence of the properties listed under items 1-3 above. Then, we calculate 
the time rate of the scalar O, the divergence of the vector f and, by using the 
equations (3) and (4), identifying s 3 with t, we recognize that 

d | d . d . 

62 + Div^S — - — X 0 + — — X 0 + - — =£ 0 , (19) 

dsi u d5 2 ds 3 u 

which proves the theorem. □ 

REMARK 1. Asa first special case, we require that alone acts on X leaving v 
arbitrary. By using (17) we obtain from (12) 

- 9 X *C + Divd F X^j v = 0, (20) 



i.e., 



Pox = p 0 b + Div P, (21) 

which is the standard equation of balance of momentum. 

REMARK 2. With G arbitrary, we consider its action alone on X: by using (18), 
with the identification .v 3 = t, we obtain from (12) that 

— dv£ + DivdvvX'j ■ ^m( v ) — 0 ( 22 ) 

or 

Po(3t-X - 3vX) + z - PoP - Div# = 0. (23) 

z = —p 0 d v e is called self-force in the terminology of [1], This result assures the 
covariance of the balance of substructural interactions. When G coincides with 
50(3), the co-vector in the parentheses in (22), namely the term multiplying 
must be an element of the null space of ,A r (see for details of this special case 

[1-4]). 

REMARK 3. As a second special choice, let be such that v = q x (x — x 0 ) 
(with q a rigid rotational velocity - depending on time only - and x 0 a fixed point 
in space) and G = 50(3); qx is an element of its Lie algebra, thus f.uf 1 ’) = -Aq. If 
X is independent of x and we assume that only fy and G (in the form just defined) 
act on X, we have 

slew (3f=CF t ) = e(A T d v X + (V2t T ) f 9 Vl ,»C), (24) 

where e is Ricci’s alternating tensor and skw(-) extracts the skew-symmetric paid 
of its argument. 
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REMARK 4. If we require that f] alone acts on X, with w arbitrary (but satisfy- 
ing 1), by using (16) we obtain from (12) that 

(¥ T d k X + V v J di,X) - D/v(p - Qa,||x|| 2 + p oX (v, i>))l) - d x X = 0, 

(25) 

where P = p 0 el — F T P — V v T *& is the modified Eshelby tensor for continua 
with substructure (see [4] for a si mi lar result in a non-conservative setting, where 
the elastic potential e is substituted by the free energy). I is the second-order unit 
tensor and, in writing the explicit expression of P above, we find convenient to 
introduce the product * defined by (Vv T *A)n u = 4n (Vv)u for any pair of 
vectors n and u. 

REMARK 5. Let us assume as special choices that fj is such that w = q x 
(X — X 0 ) (with q a rigid rotational velocity, and X 0 a fixed point in space) and 
that G = SO(3), being qx an element of its Lie algebra, thus §, w (v) = Aq. If 
the material is homogeneous, and we assume that f 1 and G alone (in the form just 
defined) act on X, we have 57 tv(F t 3 Fo C + (Vv) T *3 Vv ,dC) = 0. 

REMARK 6. The action of can be interpreted as a special virtual mutation of 
a possibly existing smooth distribution of inhomogeneities throughout the body, in 
the sense of [8], In other words, we may say that (25) is the balance of interactions 
arising when the body mutates its inhomogeneous structure. This interpretation has 
been also suggested in [9] in non-conservative setting. 



1.1. HAMILTON EQUATIONS 

Define p and fi, respectively, the canonical momentum and the canonical substruc- 
tural momentum , by 

p = diX, pi = di,X. (26) 

The Hamiltonian density M, 

JC(X, x, p, F, v, ii, Vv) = p • x + ii ■ v — X(X, x, x, F, v, v, Vu), (27) 



has partial derivatives with respect to its entries; some of them are the opposite of 
the corresponding derivatives of X so that (3), (4) can be also written respectively 
as 



p = -d x M + Divd ¥ M, 
x = 3 P JC; 

fi = —d v M + Divdy v M, 



(28) 



v 



(29) 
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2. Canonical Poisson Brackets in Multifield Theories 



We now consider a general boundary value problem where the following boundary 
conditions arc associated with (28) and (29) 



x(X) = x 


on 3 (x) l8o, 


(30) 


d^Mn = t 


on 3 (t) <So- 


(31) 


v(X) = v 


on 3 (v) <S 0 , 


(32) 


= t 


on 3 ft) 28 0 : 


(33) 



x, t, v and t arc prescribed on the relevant parts 3 ' ' ’ <S 0 of the boundary, C/(3<So) = 
CZ( 3 (x) <S 0 U 3 (t) <S 0 ), with 9 (X) 2S 0 n d (t) & 0 = 0, and Cl(di B 0 ) = CZ(9 (v, <SB 0 U 
3 (t) l8o), with fl 3 a, 43o = 0, where Cl indicates closure and n is the outward 

unit normal to 3<33 0 at all points in which it is well defined. 

Again, as below (8) we argue that physical significance can be attributed to (33) 
in view on our hypotheses on M. It should be clear that those hypotheses need not 
apply to all substructures, e.g., in “homogenized” theories of liquids containing gas 
bubbles (order parameter the gas fraction) no such microstress could be expected 
to have such physical substance; there the interactions are at least weakly nonlocal, 
and no significant connection seems to exists then for M. 

We assume that there exist two surface densities U (x) and U (v) such that 



t = p 0 d x U, t = p Q d v U, 



(34) 



where U and U plays here the role of surface potentials. 
Then the Hamiltonian H of the whole body is given by 

H(x, p, v, pt) = / J{(X, x, p, v, pf d(vol) 

J Bo 

r 

( U (x) — U (v)) d (area). 



f ( 

Jd (t) B 0 



(35) 



Notice that we write (X, x, p, v, ji) instead of Jf(X, x, p, F, v, /x, Vv) because 
below we consider directly variational derivatives. 



THEOREM 2. The canonical Hamilton equation 
F = {F, H} 



(36) 



is equivalent to the Hamiltonian system of balance equations (28), (29) for a con- 
tinuum with substructure where F is any functional of the type /(X, x, p, v, pf, 

with f a sufficiently smooth scalar density, and the Poisson bracket {•, •} for a 
complex material is given by 



{F,H} 




8M 

3p 



8M 

<5x 




d(voZ) 
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f (Sf SM 

J d m So \Sx <5p 
r ( 8f 8M 8M 8f\ 

+ JSo\ 8v S p S p 8v ) 
f (Sf 8M 

JdW£ 0 \ 8v S/l 



8M 8f 
8x <5p 



I d (area) 

d (t) B 0 ) 



d(vo/) 



8M 8f 
8v 8p, 



3 (t 'Sr 



d (area), 



(37) 



where the variational derivative 8M/8x is obtained fixing p and allowing x to 
vary;* an analogous meaning is valid for the variational derivative with respect to 
the order parameter. 

The proof can be developed by direct calculation. Clearly, {• , -} is bilinear and 
skew-symmetric, and one can check easily that it satisfies the Jacobi ’s identity. We 
note that 



d(vol) 



f (8f dM 8f 

{F,H} = / l — (d x M - DivdyM) 

J So \8x 9p <5p 

+ f (y-y- -y- ^ - 9 F ^n)| ) 

J d mg 0 \8x 9p 3 ( t )JBo 9p ld 

f ( 8f dM Sf 

+ / ( • -r j— • (d v M 

J So \8v dp. 8p . 

+ f ( 8 ±. d JH 

Jd (t) £ 0 \Sv dp 



d (area) 



Divd\/ V M) Jd(voZ) 



3«<S 0 

Sf 

j— ■ Cf,U - 

op 



ds/ v Mn)\ dWSo jd{area) 

and, in terms of functional partial derivatives, 

f (Sf . 8f . Sf Sf \ 

F = / — • x + — • p + -L.v + — -ii Id (vol) 

Jsq \ Sx <5p 8v 8p f 

Sf 



(38) 



+ / — 



S_f 
I d (t) £ 0 8x 






d (area) + — ■ v 

J d 1 



IdWjBo Sv 



d(area). 



(39) 



3 «<B 0 



By identifying analogous terms in (38) and (39), we obtain both the Hamiltonian 
system (28), (29) and the boundary conditions (30)-(33). 

When we put F = H, (36) coincides with the equation of conservation of 
energy. We have, in fact. 



H = {H, H] = 0. 



(40) 



Geometrical properties of the Poisson brackets for direct models of rods, plates 
and complex fluids have been discussed in [10, 11], 



See relevant remarks in [10]. 
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3. A Formal Approach toward an Hamilton-Jacobi Theory with Gradient 
Effects 

Let h be a smooth diffeomorphism 

h: (X, x, p, F, v, ijl, Vv) i — > (X, x*, p*, F*, v*, /x*, Vv*). (41) 

The transformation h generates a new Hamiltonian density 

<7f*(X, x*, p*, F*, v*, /x*, Vv*), (42) 

with coiTesponding Lagrangian density 

£* — p* • x* /x* • v* eTf* . (43) 

If h were such that M i: = 0, then an immediate integration of the system (28), 
(29) could be achieved. To this aim we choose h to be such that the integral of 
the difference £ — £* between two instants, say t\ and t 2 , be equal to the time 
derivative of a generating function 5 of the type 5 = 5(1, X, x, p*, v, /x*), i.e., 

f\x-x,)dr = S\ l=t2 -S\ t=tr (44) 

Jh 

Then, from (44) we would have 

(p • x + /X • v - M) - (p* • x* + /X* • v* - Jf*) 

— S — 3,5 + 3 X S ■ x + 3 Pt 5 • p* + d v S ■ i’ + (45) 

and hence 



p = 3 X S, /x = 3 V S, (46) 

x*-x 0 = 3 P<: 5, v = 3^5, (47) 

3 ,S + M = M*. (48) 

To obtain (47) one makes use of the fact that 8 p • (x — x 0 ) + /x • v) 
= 0 for variations vanishing at t\ and t 2 (in the sense that 8(/i ■ v)\ t=t]j2 = 0 
and <5(p • (x — x 0 ))| t=t\,t 2 = 0) so l h at p • x = p • (x — x 0 ) and /x • v = ft ■ v. 

A necessary and sufficient condition to assure that -R t = 0 is 

3 ,S + Jf(X, x, 3 X S, F, v, 3,5, Vv) = 0, (49) 

which is a Hamiltonian-Jacobi like equation. Since M i: = 0, p* and /x* arc con- 
stant in time, the time derivative of 5 reduces to 

5 = 3,5 + 3 x 5 • x + 3,5 • v = —M + p • x + /x • v = X. (50) 

The relation (50) allows us to determine 5 to within a constant, namely 



5 = J Xdt + const. 



(51) 
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4 . The Spatial Form 

Circumstances in which the notion of reference placement is wanting, as in the 
case of fluids or granular flows, render the choice of a material or spatial represen- 
tation not matter of form only (see, e.g., [12] for standard bodies). Here, having 
in mind the study of complex fluids, we provide a spatial variational derivation of 
the balance equations free of any concept of reference place or paragon setting * 
and without even formal recourse to an inverse motion. So, in the present section 
x e 33 is just a point in space. The notation u = u(x, t ) is used for the velocity field 
over IB. The order parameter is now v = v(x, t ) (with some abuse of notation) and 
we indicate with v = u(x, t) its rate in the present placement. The symmetric 
tensor g is the spatial metric characterizing the present state of the body; it plays a 
prominent role because in this case the counterpart of (2) of the Lagrangian density 
is of the form 

1 , 

<£(x, u, g, v, v, gradv) = -p||v||“ + v ) ~ pc(g, v, gradv) 

— pw(x, v), (52) 



with some slight abuse of notation. We then find balance equations as conditions 
verifying the relation 




X(x, u, g, v, v, gradv) d (vol) 



= 0 , 



(53) 



where <5 denotes the total variation (where we use perhaps inappropriately the ad- 
jective “total” in the sense which is sometimes accepted in elementary treatises 
when speacking of total time derivatives, as it will be clear in the developments 
below). 

To define the variation of the relevant fields, we make use of f 2 introduced at 
point 2 of Section 1 and identify <5x with v. We consider a special (though wide) 
subclass of possible vector fields x i — > v(x) characterized by the circumstance that 
they arc purely deformative; in other words, we choose v such that skewgrad v = 0. 

We then define 

<5g = ^f^g|, 2=0 = Tv g = 2 symgrad v = 2 grad v, (54) 

where f 2 * means pull back and L v is thus the autonomous Lie derivative following 
the flow v. In analogous way, we put 

Sv — T (gradv)x, (55) 

gradSv = grad <5v + {gradv) grad v. (56) 



* With the words “paragon setting” we refer to an ideal model of paragon for the material element 
and the body, say, e.g., an ideal crystal or any other choice that physical circumstances may suggest. 
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As an intermediate step we notice that 

<5 / e(g, v, gradv) d(vol) 

Js 

= / 8ed(vol) = / (2d g e ■ grad\ + d v e ■ 8v 

Js Js 

+ dgradv? ■ (grad <5v + (gradv)grad v)) d (vol). (57) 

By developing the valuation of (53), making use of (54)-(57) and Gauss theo- 
rem, we recognize that appropriate balances in the bulk are 



d u X — d x X + div( 2d g X — (gradv) T d gm dvX) = 0, 
d v X — d v X + div(d g ra d v X) = 0. 



(58) 

(59) 



Cauchy stress T is then given by 

T = —2d g X - (gradv) T S a , (60) 

where the actual microstress 3 a is defined by 



$a = -dgradvX. (61) 

In the case of simple bodies, (60) reduces to the well known Doyle-Ericksen 
formula. 



REMARK 7. A requirement of invariance of e under the action of SO (3) implies 
that 

skew(2d g X) = e(A J z a + (gradA T )‘ $ a ), (62) 

where z„ = —pd v e is the actual self-force and e Ricci’s alternating tensor. 



4.1. SPATIAL HAMILTON EQUATIONS 

To find appropriate spatial Hamilton equations, we follow the pattern of Section 1.1. 
To this end we define spatial canonical standard and substructural momenta (p and 
/2, respectively) through 

p = d u X, jl = d v X. (63) 

Consequently, the spatial Hamiltonian density is given by 

M(x, p, g, v, jl, gradv) = p • u + jl ■ u — X(x, u, g, v, v, gradv ) (64) 

(with some slight abuse of notation) and has partial derivatives with respect to its 
entries. By evaluating the variation of -7f. taking into account (54) and (56), and 
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comparing the result with the variation of X, after making use of the balances (58) 
and (59), we obtain the spatial form of the Hamilton equations: 



p = —d x J( + div(2d g J{ — (gradv) T d grac i v 3t), 
u = SpJf'; 

A = -d v M + div{d gradv M). 
u = djxJf. 



(65) 

( 66 ) 



4.2. SPATIAL HAMILTON-JACOBI FORM 

We may obtain the spatial counterpart of (49) by considering a smooth diffeomor- 
phisnr 

h: (x, p, g, v, A, gradv ) i — > (x*, p*, g*, v*, A*- gradv *), (67) 

which generates a new Hamiltonian density 

<74 (x*, p*, g*, v*, A*, gradv*). (68) 

Now, we may use a generating function S = S(t, x, p*, v, A*), and, following 
the same procedure of Section 3, we find that a necessary and sufficient condition 
to assure that -J( i: = 0 is 

d,S + Ji( x, 3 X S, g, v, 3 V S, gradv) = 0. (69) 



4.3. A SPATIAL FORM OF POISSON BRACKETS 

For the spatial Hamiltonian in equations (65), (66), taking into account (54)-(56), 
we define a new variational derivative dJ(/dx through the relation 

~8J( 

— — (x, p, v, A) • v = (— d x M + r/;V(23 g — (gradv ) T d gradv JX)) ■ v, (70) 

holding p fixed and allowing x to vary, for any v of the kind used in (54) — (56). 
Consider a boundary value problem of the type 

(23 g Jf - (gradv) T d gradv M)n = 3 x m ( x ), 

(dgradvdC ) n = 3 V u(v), on dS, 

(where u(x) and it(v ) arc the counterparts of the surface potentials U (x) and U (v)). 

The total Hamiltonian is now given by H (x, p, v, A) = / g M (with some slight 
abuse of notation) and we list only the entries (x, p, v, A) because we consider the 
variational derivative (65) below. We consider also arbitrary functionals F of the 
type f s f(x, p, v, A), with / a sufficiently smooth scalar density. 
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THEOREM 3. The canonical Hamilton equation 

F = {F, H} a (72) 

is equivalent to the Hamiltonian system of balance equations (65), (66) with 

f (If 8M IM 8f\ 

f ( Sf 8M 

+ Jas\&x <*P 
+ [(Sf'SJ i _8J i 8_f 
J s \8v 8 ii 8 ii 8v ' 

f (Sf 8M 

Jds\8v 8/a 



8M Sf 
8x <5p 



as 



8M 8f 
8v 8fi 



Id (area) 
as/ 



Id (area), 
as/ 



where {•, ■}„ is bilinear, skew-symmetric and satisfies Jacobi’s identity. 



(73) 



5. Final Remarks 

To illustrate possible uses of Theorem 2, we list below some special cases. Analo- 
gous results accrue from Theorem 3. 

REMARK 8. If we choose / = p • v, with v an arbitrary vector, equation (28 a ) 
and the boundary condition (31) follow immediately from (36). 

REMARK 9. Let / = fi ■ f u (v), then from (36) we get ( 29„ ) and the boundary 
condition (33). 

REMARK 10. Let / be of the form 

/ = p • (q x (x - x 0 )) + fi-Aq, (74) 

with q arbitrary as in previous sections. Consider also, for the sake of simplicity, ab- 
sence of external bulk interactions (the ones accounted for w(x, v)). By using (28) 
and (29), we obtain from (36) 

e(9 F JCF T ) = A T d v M + (VA T ) r 9 Vv ^. (75) 

These remarks are the Hamiltonian counterparts of Remarks 1-3. Of course, 
Poisson parentheses not only allow one to write in a concise form balance equa- 
tions, but generate articulated geometric structures over the infinite-dimensional 
manifold of mappings showing placements and order parameters, and properties of 
these structures depend also strictly on the geometric properties of M. 
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Abstract. When a body is subject to simple internal constraints, the deformation gradient must 
belong to a certain manifold. This is in contrast to the situation in the unconstrained case, where 
the deformation gradient is an element of the open subset of second-order tensors with positive 
determinant. Commonly, following Truesdell and Noll [1], modern treatments of constrained theories 
start with an a priori additive decomposition of the stress into reactive and active components with the 
reactive component assumed to be powerless in all motions that satisfy the constraints and the active 
component given by a constitutive equation. Here, we obtain this same decomposition automatically 
by making a purely geometrical and general direct sum decomposition of the space of all second- 
order tensors in terms of the normal and tangent spaces of the constraint manifold. As an example, 
our approach is used to recover the familiar theory of constrained hyperelasticity. 

Mathematics Subject Classifications (2000): 74A20, 74B20. 

Key words: continuum mechanics, internal constraints, constitutive theory, hyperelasticity. 



Dedicated to the memory of Clifford A. Truesdell 



1. Introduction 

Most contemporary works in constrained theories of continuum mechanics follow 
the approach of Truesdell and Noll [1],* wherein the stress is decomposed a priori 
into reactive and active terms with the reactive stress assumed to be powerless in 
all motions consistent with the constraints and the active stress given by a constitu- 
tive equation. The approach of Truesdell and Noll was motivated by the Ericksen 

* See Carlson and Tortorelli [2] for a fuller account of other work in this area. To that account the 
more recent work of Casey and Krishnaswamy [3, 4] must be added. As in earlier work of Casey [5] 
cited by Carlson and Tortorelli [2], these considerations are based on the behavior on the constraint 
manifold of associated unconstrained materials - an approach fundamentally different than ours. 
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and Rivlin [6] treatment of constrained hyperelasticity, which is based on the re- 
quirement that the constitutive equations for the stress and internal energy satisfy 
balance of energy in all motions consistent with the constraints. The main feature 
of the Ericksen-Rivlin hyperelastic development is that the stress is automatically 
decomposed into the sum of two terms. One term has zero power in any motion 
meeting the constraints and is determined by the constraints to within scalar multi- 
pliers; it is natural to think of this term as being present to maintain the constraints 
and to call it the reactive stress. The other term is, roughly speaking, the gradient 
of the internal-energy density with respect to the strain, and it is called the active 
stress. Carlson and Tortorelli [2] replaced the Lagrange multiplier formalism of the 
Ericksen-Rivlin approach with an elementary geometrical argument - essentially, 
the assertion that, if a vector a is orthogonal to every vector b that is orthogonal 
to some vector c. then a is parallel to c - used in the Truesdell-Noll method for 
determining the form of the reactive stress. 

It is widely accepted that many of the advances in modern continuum mechan- 
ics rest in large part on the clear separation of kinematics, basic laws of balance 
and growth, and constitutive equations that characterizes the subject. Where do 
internal constraints fit into this hierarchy? While internal constraints do delimit 
aspects of material response, they apply to broad classes of materials; for instance, 
the constraint of incompressibility applies equally well to both hyperelastic solids 
and viscous fluids. Hence, we view internal constraints as being more basic than 
constitutive equations.* It is natural then to attempt to ascertain the implications 
of the kinematical nature of internal constraints. Motivated by this point of view, 
Anderson, Carlson and Fried [8] used a modified version of the geometrical argu- 
ment of Carlson and Tortorelli [2] to deal with the constraints of incompressibility 
and microstructural inextensibility present in their theory of nematic elastomers. 
They started with a purely geometrical direct sum decomposition of the relevant 
fields based on the normal and tangent spaces of the constraint manifold to obtain 
automatically the decompositions of the deformational stress, orientational stress, 
and internal orientational body-force density into active and reactive components 
- without the use of any balance laws or constitutive assumptions. In this paper, 
we present this improved approach in the simpler context of isothermal continuum 
mechanics. We also take this opportunity to treat multiple constraints. 

In Section 2, we consider the case where the deformation gradient is restricted 
by n independent constraints. Thus, the deformation gradient is constrained to 
belong to a certain manifold in contrast to being an arbitrary element of the open 
subset of second-order tensors with positive determinant as in the unconstrained 
case. Next, we use the projection theorem to effect a unique orthogonal decompo- 
sition of the space of all second-order tensors in terms of the normal and tangent 
spaces of the constraint manifold. 



* O’Reilly and Srinivasa [7] take an analogous view in their treatment of constrained discrete 
mechanical systems. 
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In the absence of thermal contributions, the general thermomechanical princi- 
ples of energy balance and entropy growth combine to yield a free-energy inequal- 
ity, which may be simplified by means of the power identity. These considerations 
arc developed in Section 3. 

In Section 4, the orthogonal decomposition of Section 2 is applied to the stress 
tensor. We find that, for motions consistent with the constraints, the normal com- 
ponent is automatically powerless and only the tangential component enters into 
the free-energy inequality. Consequently, the tangential component is called the 
active stress, and one would expect to write a constitutive equation for it. On the 
other hand, the normal component, termed the reactive stress, is determined by the 
constraints to within scalar multipliers that we take to be constitutively indetermi- 
nate. Thus, our approach to internal constraints has the same level of generality as 
that of Truesdell and Noll [1] and provides exactly the same results. However, our 
decomposition of the stress, rather than being a priori , is dictated by the geometry 
of the constraint manifold. 

In Section 5, as an application of the general theory, we make elastic constitutive 
assumptions for the free energy and the stress and require that the free-energy 
inequality be satisfied for all motions consistent with the constraints to recover the 
theory of constrained hyperelasticity; and, in this sense, the present paper replaces 
the paper of Carlson and Tortorelli [2]. Finally, in Section 6, we show that when the 
principle of material frame-indifference is invoked in constrained hyperelasticity, 
the active and reactive stresses individually satisfy local balance of moment of 
momentum. 

Throughout, we use the notations of modern continuum mechanics; see, e.g., 
the text of Gurtin [9]. 

2, The Geometry of the Constraint Manifold 

We use a referential formulation. Accordingly, the body is identified with the region 
of space £ that it occupies in a fixed reference configuration. We write y for the 
motion of the body and 

F = Grady, (2.1) 

with det F > 0, for the deformation gradient. 

We consider the case where the motion of the body is restricted by n simple 
constraints', i.e., the deformation gradient is required to meet* 

Yi(F) = 0, i = 1 n, (2.2) 

where the constraint functions y, : Lin + — > R arc suitably smooth and independent 
in the sense that the set {Grady, -(F), i = 1 ,...,«} is linearly independent at each 

* At this level of generality, it must be required that n <9. However, once the principle of material 
frame-indifference is imposed (cf. the developments of Section 6), the constraint functions y,- are seen 
to depend on F only through the symmetric tensor F J F. Consequently, we must, in fact, have n < 6. 
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F belonging to Lin + . In other words, the deformation gradient must belong to the 
constraint manifold 

Con := {F e Lin+ : y t (F) = 0, i = 1, . . . ,n}. (2.3) 

Of great use to us will be the normal space to Con at F, 

Norm(F) := LspjGrad y,(F), i = 1 «}, (2.4) 

and its orthorgonal complement in Lin, 

(Norm (F)) x = {A e Lin : A B = 0, VF e Norm(F)} 

= {A € Lin : A -Grad y,(F) = 0, i — l, ... ,n] 

=: Tan (F), (2.5) 

which is the tangent space to Con at F . 

Of course, the constraint equations (2.2) must hold for all time, and time differ- 
entiation yields 

Grad p,(F)-F = 0, i = l,...,n, (2.6) 

which, in view of (2.5), is equivalent to 

F € Tan(F). (2.7) 

If the body actually occupies the reference configuration at some reference time (so 
that yi(I) = 0, i = I, . . . . n), then (2.6) implies (2.2) (see Carlson and Tortorelli 
[2]); hence, in this case, (2.7) is equivalent to (2.2). 

By the projection theorem, Lin admits the direct sum decomposition 

Lin = Norm(F) © Tan(F); (2.8) 

i.e., each A e Lin can be written uniquely as* 

A = A ± + A„ Aj_ € Norm (F), A, e Tan(F). (2.9) 

In view of (2.5), (2.7), and (2.9), 

A ± F — 0, A-F = A r F. (2.10) 



3. Free-energy Inequality 

We restrict attention to processes in which the temperature is independent of po- 
sition and time; in this case, the principles of energy balance and entropy growth 
(in the form of the Clausius-Duhem inequality), or the first and second laws of 

* Our usage of the subscripts ± and || here is exactly opposite to that used by Anderson, Carlson 
and Fried [8], 
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thermodynamics, combine to yield a free-energy inequality. On using tP to denote 
an arbitrary regular paid of 18 with boundary 3 IP and unit outw ard normal field n, 
this free-energy inequality requires that 



/ pi f + ^|ti| 2 J dv ^ / Sn-vda + / pb-vdv (3.1) 

J.p V / J 89 J.p 

for each instant and for all parts. Here, p is the referential mass density, v is the 
velocity field, f is the free energy per unit mass in the reference configuration, S 
is the first Piola-Kirchhoff stress tensor, b is the body force per unit mass in the 
reference configuration, and the superposed dot indicates time differentiation. 

Next, we recall that an easy consequence of the principles of mass balance and 
momentum balance is the power identity, which asserts that 



/ 5M-itdn+ / pb-vdv = / S-f’dit-f / \p\v\ 2 dv 

JdP J ,9 J ,9 J,9 

for each instant and all parts. Equations (3.1) and (3.2) imply that 



/ pf dv^ S F dv 

J ,9 J ,9 



for each instant and all parts. The local equivalent of (3.3) is 

pf ^ S F, 



(3.2) 



(3.3) 



(3.4) 



and it is this inequality on which our subsequent considerations of hyperelasticity 
are based. 



4 . Active and Reactive Stresses 

On employing the decomposition (2.10) in the particular case when A is identified 
with the first Piola-Kirchhoff stress S, it follows from the power identity (3.2) that 
only the component S B expends nonzero power over a constrained motion, and we 
refer to ,S' as the active component of the stress and write 

S, = S a . (4.1) 

On the other hand, the component S is powerless in a constrained motion, and we 
refer to S as the reactive component of the stress and write 

Sx = Sr. (4.2) 

Finally, since S r belongs to Norm(F), it follows from (2.4) that there exist scalar 
fields Xx, ... ,X n that we take to be constitutively indeterminate such that 

n 

S T = J] a, G rad y,(F). 

7 = 1 



(4.3) 
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Thus, we have shown that, when a body is internally constrained by simple con- 
straints of the form (2.2), the geometry of the constraint manifold dictates that 
the stress is automatically decomposed into the sum of two components: a pow- 
erless component 5 r that is determined to within scalar multipliers by (4.3); and 
a component S a that does expend power and consequently appeal's in the free- 
energy inequality. We emphasize that this result is independent of any constitutive 
considerations other than the “simple” nature of the constraints; in particular, the 
body need not be elastic. 

A noteworthy feature of our approach is that, in view of (4.1), (4.2), and (2.9), 

S a -S Y = 0. (4.4) 

This automatic normalization is important, because the presence of the constitu- 
tively indeterminate multipliers in S r (see (4.3)) means that the response function 
for any component of S a not orthogonal to S r could not be measured. 

5, Constrained Hyperelasticity 

In the constrained case, it follows from (2.10) and (4.1) that the local free-energy 
inequality (3.4) reduces to 

pi> ^ S a F. (5.1) 

For hyperelasticity, we make the constitutive assumptions that 

xjr = jr(F), xjr. Con -> M, (5.2) 

and 

S a = S a (F), S a : Con Tan(F) 

Now, with xj/ assumed to be smooth, 
xjr = Grad,i fr(F)-F; 

so the local free-energy inequality becomes 

(S a (F) - pGrad ll f(F))-F ^ 0. (5.5) 

In the spirit of Green [10, 11], Ericksen and Rivlin [6], and Coleman and Noll 
[12], we require that our constitutive equations be restricted such that the local 
free-energy inequality (5.5) is always satisfied. To make this precise, we say that a 
constrained hyperelastic process consists of: 

(i) a motion y consistent with the constraint equations (2.2); 

(ii) scalar fields ki, . . . , k„; 

(iii) a free-energy field xjr given in terms of y by constitutive equation (5.2); 

(iv) an active stress field S a given in terms of y by constitutive equation (5.3); 



(5.3) 

(5.4) 




GEOMETRICALLY-BASED CONSEQUENCES OF INTERNAL CONSTRAINTS 



147 



(v) a reactive stress field S r given in terms of y and Xi, ... ,X n through (4.3); and 

(vi) a body force field b determined in terms of the above fields through local 
balance of momentum. 

Then, we insist that the local free-energy inequality (5.5) be satisfied for every 
constrained hyperelastic process. At least locally, it is possible to choose a con- 
strained hyperelastic process such that, at any given position and time, F and F 
take on arbitrary values in Con and Tan(F), respectively. Since both SJF) and 
Grad || i Jr(F) belong to Tan(F), we conclude that 

S a (F) = pGrad^(F). (5.6) 

In (5.4)-(5.6), Grad n x[r(F) represents the tangential gradient of i ]/ at F. When 
the response function r/V admits a smooth extension off the constraint manifold to 
an open subset of Lin + , then 

Grad„i }(F) = ^1 - ^ tV,-®A^Grad \jr(F), (5.7) 

where the fourth-order tensor I is the identity operator on Lin, {A',-, i = 1 ,...,«} 
is an orthonormal basis for the linear subspace Norm ( F ), and A®B is the fourth- 
order tensor defined such that ( A®B)C = ( B C)A for any second-order ten- 
sor C. 



6. Material Frame-indifference and Moment-of-momentum Balance 

An interesting feature of hyperelasticity in the unconstrained case is that the prin- 
ciple of balance of moment-of-momentum need not be taken as an axiom; rather it 
appeal's as a theorem in the theory primarily as a consequence of the principle of 
material frame-indifference. In this section, we show that this is the case also in the 
constrained theory as developed above. 

As noted in the introduction, internal constraints do delimit aspects of material 
response. Thus, the kinematical restrictions embodied in (2.2) are subject to the 
principle of material frame-indifference: 

Yi(QF) = Yj(F ), i = 1, . . . , n, V(0, F) e Orth + x Lin + . (6.1) 

A standard consequence of (6. 1) is that for each i 

9i(F) = ft (C) , Yi : Psy m R, (6.2) 

where 

C = F T F (6.3) 

is the right Cauchy-Green deformation tensor. 
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By (6.2) and (6.3), 

Grad p, (F) = 2FGrad y, (C ) , (6.4) 

and (4.3) becomes 



S r = h FGrad Yi (C) (6.5) 

(=i 

in terms of the reduced constraint functions y,-, where the factor of 2 has been 
absorbed into the constitutively indeterminate multipliers. An immediate conse- 
quence of (6.5) is that 

S y F t =FS r T , (6.6) 

which is the local form of balance of moment-of-momentum for the reactive stress. 

Similarly, material frame-indifference requires that the constitutive equation 
(5.2) for the free-energy density reduce to 

t = $(C). (6.7) 

Here, of course, the domain of \j/ is the reduced constraint manifold 

Con(C) := [C e Psym : y,(C) —0,i = 1 (6.8) 

In terms of ip, (5.6) becomes 

s a = S a (C) = 2pFGrad„ip(C), (6.9) 

where Grad „ now denotes the tangential gradient with respect to the manifold Con. 
Furthermore, it follows from (6.9) that 

S. d F T =FSl, (6.10) 

which is the local form of moment-of-momentum balance for the active stress. 
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1. Introduction 



In the analysis of mechanics, need often arises to determine whether certain quad- 
ratic integral inequalities hold. For example, in a recent work, Chen and 
Haughton [1] study the stability of inflation and stretch of thick-walled incom- 
pressible elastic cylinders. An energy stability criterion is used, which leads to a 
variation problem. The second variation condition reads 




Vu • Wf F [Vu] + Wp • V (VuF _1 u)} dV ^ 0, 



( 1 ) 



where Q is the cylindrical reference configuration, F the deformation gradient, 
VT (F) the strain energy function, and u the variation function. By incompressibility 
and a Fourier analysis, inequality (1) is found to be equivalent to the following 
quadratic integral inequality 

b 

U(r) • A(r)U(r) dr ^ 0, 



where r is the radial coordinate, a and b the inner and outer radii, respectively, 
of the cylinder, A(r) a 3 x 3 symmetric matrix whose components are smooth 
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functions of r, and U(r) = (u(r), u'(r), u"(r)) T is a column vector consisting of 
an arbitrary smooth scalar function u(r) and its first and second order derivatives. 
The above inequality was solved in [1] and is a special case of the problem studied 
here. In this work, we develop a general method to determine the positivity of 
quadratic integrals involving derivatives of arbitrary order. 

Let [a, b ] be an interval in R, and n be a positive integer. We define the space 
of admissible functions V by 

U = C n ([a,b];] R). 

Consider the following quadratic integral inequality 




a,j(x) 



d'«(x) dhi(x) 
dx' dxJ 



dx ^ 0, 



( 2 ) 



where a i; - e C"([a, b\, R), i, j = 0, 1, ... , n, satisfy 



a ij = a j j . (3) 

The objective of this work is to determined whether inequality (2) holds for all 
u £ U. 

It is noted that (2) is the second variation condition for minimizing an integral 
involving higher order derivatives of the competing function. The case where n = 1 
has been treated by many authors, and has become a paid of classical theory of 
calculus of variations. See, for example, Hestenes [2] and Sagan [3]. The focus of 
this work is on the cases where (2) contains derivatives of u(x) of any order. While 
the conditions to be derived in this paper pertain to (2) as it stands, they can be 
made appropriate for the strict inequality version of (2). 

It is also noted that (2) can be recast in a form which is related to a constrained 
minimization problem studied in control theory. Indeed, by writing w,(x) = 
d' u (x) /dx' , the left-hand side of (2) can be rewritten as 

/ b n 

Y ajj ( x)uj ( x)uj (x) dx. (4) 

ij = 0 



A problem in control theory is to find u„(x) that minimizes (4) subject to the 
differential equation constraints 

dn,(x) 

= M, + i(x), i = 0, 1 , . . . , n — 1 . 

dx 

Variations of this latter problem, often for the constant coefficients a, ; with a,„ = 
0, i — 0, . . . , n — 1, have been studied in control theory (for example, see [4-6]), 
where u n (x) is called control function. An important development is the associated 
matrix Riccati equation whose solution determines the optimal control function. 
The solution of the Riccati equation has been studied extensively [7-10]. 
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For unconstrained inequality (2) itself, some sufficient conditions and necessary 
conditions can be derived by elementary arguments. Let A (x ) be the (n + 1 ) x (« + 1) 
matrix function with cijj (x) being its elements. An obvious sufficient condition 
for (2) to hold for all u G U is that A(x) be pointwise positive semi-definite, that 

is, 

v • A(x)v ^ 0 Vve M" +1 , x e [a, b]. (5) 



This condition, although very simple, is in most cases too strong to be practically 
useful. 

Condition (5) is a pointwise algebraic inequality. A condition of this kind is 
desirable as it usually allows a simple verification. However, a necessary and suffi- 
cient condition for (2) to hold is in general not pointwise. 

When A is a constant matrix, one can derive various algebraic necessary con- 
ditions by taking special form of u (x) in (2) and carrying out the integration. For 
example, by choosing u (x) = e kx , k e R, we find that (2) implies that 



v • Av ^ 0 



Vv = 



/ 1 \ 

k 

k 2 



\ k " / 



This scheme, however, may not lead to useful result when A is not constant. Even 
when A is constant, the necessary conditions obtained this way may well be much 
weaker than (2) itself. 

The difficulty in finding necessary and sufficient conditions lies in the fact 
that V. is an infinite-dimensional function space. This problem is dealt with in 
the present work by relating the integral in (2) to a system of nonlinear ordinary 
differential equations, which is identified as the Riccati equation in control theory. 
The analysis presented here provides the workers in mechanics with a direct access 
to the solution of (2) without referring to the constrained minimization problem 
studied in control theory, and without the usual restrictions on cijj, which arc par- 
ticular in control theory. In addition, we present a detailed treatment of the case 
where the solution of the Riccati equation becomes unbounded. 

Besides its theoretical value, the method presented in this work offers great 
numerical advantage in dealing with the integral inequality (2). As no numerical 
method has been developed to solve such an inequality, numerous ODE solvers arc 
available. Once the coefficients a ;/ arc given, the corresponding Riccati equation 
can always be solved numerically, and the properties of the solution provide a 
definite conclusion on whether (2) holds for all u e U, as demonstrated in [1] 
where the stability of inflation of elastic cylinders has been determined for the first 
time. 

In the next section, we state some well-known results in calculus of variations 
regarding the integral in (2). Section 3 introduces the system of differential equa- 
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tions upon which the solution method is based. In Section 4, a necessary and 
sufficient condition is derived when the solution of the differential equations is 
bounded. In Section 5, the case where the solution is unbounded is analyzed. An 
illustrative example is given in the concluding Section 6. 



2. Preliminaries 



The quadratic inequality (2), being the second variation condition of some mini- 
mization problem, forms a minimization problem itself. The first variation condi- 
tion of this minimization problem reads 




d'u(x) d'v(x) 
dx' dx 7 



dx = 0. 



(6) 



If «(x) is a minimizing function of the integral in (2), it must satisfy (6) for any 
v(x) in a class of variation functions. If both u (x) and v(x) arc smooth (say, of 
class C 2 "), one can integrate (6) by parts n times to obtain 



r b « 

Ll 



E d 'u d J v 
a i i — — t — — dx 
1 dx 1 dx J 

‘.j = o 



d k ~ l / d'u 

k= 1 j=k i=0 x 

f b . d J ( d'u \ 

+ l 



d'u\ d ' k v 



dx J 



-k 



i,j = 0 

n — 1 n—l n 



:c-iy 



; d J / d'u 
dxJ \ lj dx' 



+ 



= 0. 



d k - ] / cYu \ d'u 

1=0 k= 1 1=0 x 7 

/ b " 

E 1 

i.j = 0 

" " c\ k / H' 7 / N 

EED-»^ 

/=i *=o £=0 x 7 

r h A , d J / d' « 

l £ <_ srs?, 



i; dx 



d'nXd'- 1 !; 



Here a change of variables j — k + 1 for the summation indices has been used. 

By a standard argument [2, 3] in calculus of variations, one can derive the fol- 
lowing Euler-Lagrange equation which must be satisfied by a minimizing function 
n(x) at the points where n(x) is of C 2 ": 



n 



E<-‘) 



dx> 



( d'u 

V !/ dri 



= 0 . 



(V) 




SECOND VARIATION CONDITION AND QUADRATIC INTEGRAL INEQUALITIES 



155 



Furthermore, at a point where u(x) is not of C 2n , the Weierstrass-Erdmann corner 
condition must hold, which states that 



n—l n 



ED- u‘ 



d* 

dx k 



di,k+l 



d' li 
dx' 



l = 1 



must be continuous. 

A well-known pointwise necessary condition for (2) to hold is the Legendre 
condition* 



a nn 0)^0 Vx e \a, b\. (8) 

Indeed, if a nn (x o) < 0 for some xo e ( a,b ), one can construct an admissible 
function u(x) that has a non-empty support contained in a neighborhood of xo, and 
that is so oscillatory that (2) is violated for this w(x). The conclusion that (8) must 
hold at the end points follows from the continuity of a nn . In this work, we assume 
that the following strengthened Legendre condition holds: 

a nn (x) >0 Vx e \a, b\. (9) 



3. A System of Differential Equations 

Of central importance to the solution of (2) is the solution of a system of first order 
differential equations. Via this system of equations, the left-hand side of (2) can 
be written in a form whose positiveness can be determined unambiguously by the 
solution of the system. 

To be determined is an n x n matrix function Y(x), whose elements v,- / (x), 
i, j = I , satisfy the following system of ordinary differential equations and 
initial conditions 

d Xij (tlj — | n yin')(.@j—l,n V jn) 

= (ii ij i - ytj - 1 - V/-1 

i, j = 1, 

}'ij (a) = 0 . 

The notation in (10) is so chosen that 

y <0 = yo< = o, i = l,...,n. (12) 

It follows from (3) that Y is symmetric: 

yij=yji ■ ( 13 ) 



( 10 ) 

(ii) 



The boundary value problem (10, 11) for a system of differential equations is 
introduced to solve the integral inequality (2). Through this system of differential 

* The classical Legendre condition pertains to (2) for n — 1 . 
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equations, the left-hand side of (2) can be integrated by parts in such a way that 
the resulting integrand is the square of a function which can be made to vanish for 
a certain choice of the admissible function. Whether (2) holds is then determined 
by the boundary terms, which arc further related to the solutions of this boundary 
value problem. 

The system of differential equations (10) is identified as the matrix Riccati 
equation in control theory where the coefficients a;j arc often taken to be constant 
with o,„ = 0, i = 0, — 1. The solution of the Riccati equation has been 

widely studied [7-10]. 

The general theory for the solution of the initial value problem (10) and (11) 
is well developed. See, for example, Ince [11]. It can be shown that under condi- 
tion (9), a Lipschitz condition is satisfied in a neighborhood of the initial point. 
A unique continuous solution of (10) and (11) then exists in the neighborhood. 
This neighborhood, however, may or may not cover the entire interval \a. b\. The 
solution may become unbounded as x approaches some c f b. It will be shown 
below that in this latter case inequality (2) is violated for some admissible function. 

The system of differential equations (10) is nonlinear. It is known in control the- 
ory, as shown below, that this system can be related to a 2/rth order I i near ordinary 
differential equation, which is identified as a generalization of the classical Jacobi 
equation. The behavior of the solution to this equation is well understood. It is 
found that if the solution of (10) and (11) becomes unbounded at a point in ( a , b\, 
then the 2/ith order equation has a solution which and its first n — 1 derivatives 
vanish at the point. 

In the next section, we first consider the case where the solution of the initial 
value problem is bounded in \a. b\. 



4. Bounded Solution 

When the solution of (10) and (11) is bounded on [a. b], whether (2) holds for all 
u e Vi is determined completely by the values of the solution and its derivatives 
at b. This will be proved by using the following lemma, which will also be utilized 
for further development. 



LEMMA 1. If the initial value problem (10) and (11) has a solution Y (x) that is 
of C 1 on [ a , c ] for some c G ( a , b], then for any u e U the following equality holds 



n— 1 



f c v-^ d 'u d J u d'u d'u 

I h a ^^j<rc=hy< + u t+ Mj-(c)j-(c) 

i.j = 0 1,7=0 



dx' dxJ 

n— 1 



_ [ C I d^M Oin ~ yi+l,n d'll 

! a Clnn \dA" ^ a nn dx' 



dx. (14) 



2 
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Proof. Let the solution y ;/ - (x) with the required properties be given. By (3), (12), 
(10), (1 1) and (9), we find that 



f c A d'u d j u 



n-1 j i j; n— 1 
d u d J u 



f( 
/•lg[( 



E cr u a J u d 'u d "u d "u d "u \ 

an — — + > 2a in — - h a nn — dx 

dx' dx' ' dx' dx" > 



i,j = o 

n — 1 

E 

L;=o L 



i=0 



a ij ~ yi+hj ~ 3T./+1 



dx" dx" , 

(a,„ - y !+ i,„)(a 7 „ - y ;+ i,„)^d'n d'u 

dx' dx' 



d'u d J+] u 
+ 2 yi+i.j+\ f — dx./+i 



&nn 
n— 1 



d”ii / \cTwcTw 

+Clnn d. * d^ + ^ 2 \ ain ~ yi+Un )d^d^ 

i = 0 v 7 



n— 1 



(a,„ - ;y;+i,„)(a/„ - d'w d'u 1 ^ 



f c / 

- E 

Ja Li, ;=0 ' 



+ E 

• • r\ M ' fit 

1 >J= o 

” _1 'd y (+ !,;+! d'n d'w 



dx' dx' 



dx 



d' n d' +1 u\ 

d7 dI7 + 2 ' v ' l - / ' l d7 d^r J 



+ a n 



n— 1 



d n At 1 j i \ 2 

u ^ a,„ - y 1+1 ,„ d'n 



dx' 



+ E 

i=0 



dx' 



dx 



r\ v— l d / d'u d'u\ / d"n A a in — y, +1 „ d'u 

j, [E + 



n — 1 



«=0 
n— 1 



E , , d'u d'u , f c ( d” tt v-^ cii n ~ yi+i.ti d'u 

.y, + u + , W j-fri + o™ (j^ + E dF 

i,j=0 a v i=0 



dx 



dx. 



□ 



Lemma 1 and inequality (9) readily render the following sufficient condition 
for (2) to hold. 

THEOREM 1. If the initial value problem (10) and (11) has a solution Y(x) that 
is of C 1 on [a, b\, and if the matrix Y (b) is positive semi-definite, then inequal- 
ity (2) holds for all u e U. 

In Theorem 1, the condition that a C 1 solution of (10) and (1 1) exists on \a. b] 
is crucial. For a particular problem, this condition may not be satisfied, as demon- 
strated by the example in Section 6. This case will be treated in the next section. 
In the next theorem, we shall show that when the above-mentioned condition is 
satisfied, the remaining conditions in Theorem 1 are actually necessary for (2) to 
hold. 
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THEOREM 2. If the initial value problem (10) and (11) has a solution Y(x) 
that is of C 1 on [a, b\, and if the matrix Y (b) is not positive semi -definite, then 
inequality (2) does not hold for some u e VL. 

Proof By the given conditions, there arc v, € M, / = such that 



X yjj(b)VjVj < 0. 

U = 1 

Consider the following initial value problem for u (x) : 

d n u a in - y i+h „ d'u Q 

dx" ^ a nn dx' 

i=0 



(15) 



(16) 



d'u 

— (b) = v i+ 1 , i - 0, 1, . . . , n - 1. 
dx' 



(17) 



By the theory of linear ordinary differential equations, this initial value problem 
has a unique solution u(x) on [a, b\. For this «(x), we have, with the help of (14), 

f b d'u d'u d'u d'u 



i,j = 0 



0=0 



dx 1 dx 

n— 1 



r d n u 

+ L ‘ , "T^ + S 



a in ~ }'i+\,n d'u 

— dx 

&nn QX l i 



n— 1 



= XI yi+hj+l(b)v i+ iV j+1 
0=0 

< 0. 



The last two steps follow from (17), (16) and (15). 



□ 



5, Unbounded Solution 

In this section we consider the case where the solution of the initial value prob- 
lem (10) and (11) becomes unbounded as x approaches some c e (a, b\. The- 
orems 1 and 2 arc in this case inapplicable. We shall show that it is possible to 
construct a function m e K for which inequality (2) is violated. To this end, we 
first relate the system of the first order nonlinear differential equations (10) to a 
sequence of linear differential equations. 

Let the solution Y(x) of (10) and (1 1) be given, that is of C 1 in [a, c). Consider 
the following nth order linear differential equation for n: 

All n 1 A i 

d n u a in - y i+ hn d'u 

dx" a dx' 

i=0 



( 18 ) 
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It is noted that equation (18) is identical with (16), and that a solution u(x) of (18) 
makes the last integral in (14) vanish. By the theory of linear ordinary differential 
equations, all solutions of (18) form an //-dimensional I i near space. In this work, 
we shall assume that these solutions arc of class C 2 " . 

LEMMA 2. A solution u(x) of ( 18) satisfies 

d / X d'u\ X d'u X d'u 

dllX*+u d 7 ) = X a o-i 77 - 2 ^yw,H 77 * 

j = (19) 



Proof By using (10), (18), (12), (13) and (3), we find that 



d / X d' u 

7r l X- v ' +1 4 77 



= J2 yi + i j 



+ X - 



ywj - 1 _ ytj 



(a i n yi+l,n')(.ttj—l,n 



A d' u X' d' u d" u 

X y‘j 77 + X^o- 1 - ^+i,y-i - ytj) T- + (oy-Rn - y>) — 



^ dx ^ 

i=l i=0 



^ d'w 

= X^O' -1 ~~ Ti+W-i) 777 + “/-bn 737 



A d'u U d'u 

X A” 1 T-7 - X-^+hf- 1 HW' 



PROPOSITION 1. A solution u(x) of (18) satisfies 

X d' /A d'u\ A d ' u 

X ( “ 1} ^7 (X «'>-/+;• ^7 1 -Xw+W ^7 = °’ 

7=0 \ i=0 / i=0 

l = 0 , 



Proof Equation (18) can be rewritten as 



A d'n A d'u 

X a ‘ n 77 ~ X ;v ''+ 1 ’' , ha _ °- 



^ - dM ^ 

i=0 i=0 



(21) 
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Taking the derivative of (21) / times and using (19) repeatedly, we find that 

d 1 / ^ d' u d' u \ 

HA 1 ^ Uin ~> ~ 2^ yi+1 ’ n fai ) 



dr / \ z — 1 ' '"dx 

\ i = 0 i=0 



n — 1 



d’ ( d' U \ d 1 1 ( -fi-y d' u ^-7 d' u 

= ) ~~ jfn \ - 2^y‘+hn-i-^— 

1=0 ■ ) 



! ^-'m=o 



d’ /A d'n \ d'- 1 AA d'V 



dx ( ' dx' / dx ,_l 

\ 1=0 / \ 1=0 



+ 



T-2 



d.x ,_2 



«— 1 



r n 1, At— i J, > 

E d' u d' u 

a ‘' n ~ 2 d7 - l^ yi+l ' n - 2 ~ 



\ 1=0 



i=0 



dx' 



/- / d 7 / d ' m ' 

= X (_1) T“7 (S' 777 



i=2 



v i=0 






n— 1 



dx 



V 1=0 



dx' 



yi+l,n-;+l 



i=0 



d'n 

dx' 



/- ,■ d 7 / d' n ' 

= X (_1) 771 X^-^TTT 



j = i 



v i=0 



«— 1 



, n/ /f d'n ^ d'n 

+ ( 1 ) 1 ^ «,Mi-Z • X^ 1 -""'^ 

\ 1=0 1=0 



i-i d 7 ( •st' d'n\ i+iv-' 

X* -1 ) ' ^7 ( X a o«-i+j ~j^7 ) + (_1) X^-'-' 



j = o 

= 0 . 



v i = 0 



i=0 



d'n 

dx' 



□ 



An important consequence of Proposition 1 , obtained by taking l — n in equa- 
tion (20) and using (12), is 



COROLLARY 1. A solution n(x) of (18) satisfies 



A d Jf d' n \ 

^ ( - 1,J d77(“«SF )= a 

i.;=o v 7 



( 22 ) 



Equation (22) is found to be a generalization of the classical Jacobi equation, 
which was originally derived for the variational problem involving the first order 
derivative of the admissible function. It is also noted that (22) is identical with the 
Euler-Lagrange equation (7) since the functional in (2) is quadratic. The theory for 
such an equation is well developed. Among other things, since n ;/ e C" ([ a , b ], M), 
all solutions of (22), and therefore all solutions of (18), arc bounded on [a, b\. 
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PROPOSITION 2. If the solution Y(x) of { 10) and (11) is of C 1 in [a, c ), and is 
unbounded at x = c, then there exists a nontrivial solution n(x) of ( 18) on [a, c ], 
that satisfies 

d ‘u 

— r(c) = 0, i = 0, 1, ...,n - 1. (23) 

dx 1 



Proof The first n equations of (20) can be rewritten as a vector equation 

Y(x)w(x) = g(x), (24) 



where the n x n matrix function Y(x) is the solution of (10) and (11) under consid- 
eration, and w(x) and g(x) are n -dimensional vector functions whose components 
arc given, respectively, by 



wt(x) = 



d'~ l u(x) 

dx' -1 

n—i 



i = 1, . . . , n, 



j = 0 



'^2a k j + j(x) 

_k = 0 



d k u(x) 
dx k 



i = 1, ... ,n. 



(25) 

(26) 



Equation (18) has n linearly independent solutions u (1) (x), u (2 \x), ... , u tn> (x) on 
[a, c). By Proposition 1, these solutions also satisfy (20), and therefore (24). Let 
w (1) (x), w <2) (x), w (w) (x) and g (1) (x), g (2) (x), . . . , g (,i) (x) be the correspond- 

ing vector functions defined through (25) and (26), respectively. Then each pair of 
functions w (/ Tx) and g (/) (x), / = l, ... ,n satisfy (24). Since Y(x) is unbounded 
at x = c, at least one eigenvalue of Y(x) is unbounded at x = c. It then follows 
from (24) that the vectors w (/) (c) , / = \ ..... n arc all orthogonal to the associ- 
ated eigenvector, and therefore are I i near dependent. Hence, there exist constants 
c\, C 2 , . . . , c n , not all zero, such that 



n 

£>w (/) (c) = 0. 

;=i 

Now define 

n 

u(x) = Y.ciu«\x). (27) 

i=i 

This function satisfies (18) and (23). Moreover, n(x) is nontrivial since functions 
m (/) , / = 1, . . . , n are linearly independent on \a. c). □ 



It is noted that Proposition 2 is related to the notion of conjugate point in the 
classical theory of calculus of variations [2, 3]. For the case n = 1, the point x = c 
is said to be conjugate to x = a if the Jacobi equation has a nontrivial solution that 
vanishes at these two points. 
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PROPOSITION 3. If the solution Y(x) of (10) and (11) is of C 1 in [a, c ), and is 
unbounded at x = c, then there exists a nontrivial solution n(x) of ( 18) on [a, c ], 
such that 




d'u c Vu , 

an r — r dx = 0. 

1 dx' d XJ 



Proof Let u(x) be given by (27). By (14), (18), (25), and (24), we find that 




d'u(x) d'u(x) 

— T“ r— d ' Y 

dx' dx' 



n— 1 

= dm V y,+i, ;+1 (x) 

x-*c 

i.j = 0 



d'n(x) d'w(x) 
dx' dx' 



i 



+ / (x) 



n— 1 



d"w(x) , ^ a,„(x) - y (+ i > „(x) d'n(x) 



“i2 



dx" 



+ E 



n— 1 



lim E -T+l.j+lW 

r— J 



0=0 

= lim [Y(x)w(x)l • w(x) 
= g(c) • w(c), 



1=0 

d' « (x ) d j u(x) 
dx' dx' 



"///'/ (a) 



dx' 



dx 



where w(x) and g(x) are defined by (25) and (26) through u(x). The desired 
conclusion then follows from Proposition 2. □ 

We are now in a position to prove the main result of this section. 

THEOREM 3. If the solution Y(x) of (10) and (11) is of C 1 in [a, c), and is 
unbounded at x = c for some c < b, then inequality (2) does not hold for some 
u G U. 

Proof Let u(x) be given by (27). Define 

_ . . [ u (x) for a ^ x ^ c, 

u(x) = { 

l 0 for c < x ^ b. 



It follows from Proposition 3 that 
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Suppose that (2) holds for all u e Vi. Then we would have 




d 'u d J u 
an — - — r dx 

1 dx' dxJ 



< 



f b d‘ a d J u 

/ / an — - — r dx 

Ja ,“o d ' Y ' ^ 



V« G U. 



This implies that u is a minimizing function of the quadratic integral in (2), and 
therefore must satisfy, in addition to the Euler-Lagrange equation (7), the 
Weierstrass-Erdmann corner condition. This latter condition asserts that the ex- 
pressions 



n—k n 



ED-i) 





k = 1 ,...,« 



must be continuous at x = c. By the definition of w(x), we would then have 



n—k n 

EE<- 1)J 

7=0 i = 0 



d' 



d'u 



dx' \“ i j ' k dx' 



= 0, k = l, ... ,n. 



(28) 



By Corollary 1, function u is a solution of the 2/ith order linear differential equa- 
tion (22). The initial conditions (23) and (28) then imply that w(x) is identically 
zero on [a, c], which is a contradiction. □ 



To complete the solution of the integral inequality (2), it remains to analyze the 
case where the solution Y(x) of (10) and (1 1) is of C 1 in [a, b), and is unbounded 
at the end point x = b. We shall show, in the following theorem, that a function 
u G U can be constructed for which (2) is violated. 

THEOREM 4. If the solution Y(x) of (10) and (11) is of C 1 in [ a,b ), and is 
unbounded at x = b, then inequality (2) does not hold for some u e U. 

Proof. Let w(x) be given by (27) in Proposition 2, with c therein being replaced 
by b. By Propositions 1 and 2, and Corollary 1, this u satisfies (18), (20), (22) 

and (23), again with c being replace by b. Define functions gj (x), i = 1 n, 

through (26) and u(x). The values gfb), i = \ ..... n, cannot be all zero, because 
otherwise i5(x) would be identically zero, in virtue of (22) and (23). Now define 

n ~i * 

s w s 

1=0 1 ■ 

and 



n(x) = n(x) — eg(x), (29) 

where r is a small positive number. Obviously u € U and 

^(b) = g i+1 (b), i — 0, 1, ... ,n — 1. 
dx' 



(30) 
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Furthermore, by (14), (29), (18), (20), (26), (23) and (30), we find that 

"b n Ai,. rb " 



r v- a 'u ah , , r ■F d '« d '« 



= lim 

x^-b 



dx 



n — 1 



4-A d'n d 7 w f x ( d"u 4-4 a in 

. 23 «+■•/+■ s? dtr + 1 a - ( 4? + 23 - 

U,./=o \ i=o 

" ft — 1 

? I 

e lim 

x^-b 



yi+l,n d l U 



dx' 



2 -i 

| dx 



d 'S d J g 

2^ ywj + 1 dA , dv i 



L/,7=0 

C x 

"F / I 



«— l 



d”g + a in ~ yi+ i.n d'n 



dx 



i=0 



dx' 



dx 



/ 4-4 d'n d'n - 4—1 d'g d 7 g N 

- v ™ ( - V,+1J+1 d 7 - e 2^ - v '+ 1 ’ 7 + 1 

V,7=0 i,j = 0 ) 



n— 1 



^ d'n /d'n d'g 

= hmy]y, t , J+1 — — -2 t — 



1,7=0 

n— 1 n—j—i 



= limV V (-l)*^-( y« ij+{+ i- U— -2^ 

i->s 4-^ 4-^ dr* \ 4— «■ dx' / V dx ' dx ' 



7=0 k = 0 

ft— 1 



7=0 



o '=0 



d'n . d'g 



limV^ +1 — -2c^ 

.t^T) 5 ,+ V dx' dx' 



dx' 



dx 7 



n— 1 



= ^| ;+1 (&)[-2 e g ;+1 (&)] 

7=0 

= -2c|g((7)| 2 . 

Since g(fe) is nonzero, the desired conclusion follows for sufficiently small e. □ 



In summary. Theorems 1, 2, 3 and 4 provide a definite conclusion on the pos- 
itivity of the quadratic integral in (2). The conclusion hinges upon the solution 
Y(x) of the initial value problem (10) and (11). For a practical problem, once the 
coefficients a d (x) are given, this initial value problem can be solved numerically. 
If the solution Y(x) becomes unbounded at some c ^ b, the inequality (2) does 
not hold for some u e V. On the other hand, if the solution is bounded on [a, b], 
whether (2) holds for all u e V depends on whether Y (x) is positive semi-definite 
at x — b. 




SECOND VARIATION CONDITION AND QUADRATIC INTEGRAL INEQUALITIES 



165 



6. An Example 

As an illustrative example, we consider the inequality 

1 

\k 4 u 2 + 2k 4 xuu' + k 2 ( 1 — kx) 2 u' 2 + 6 k 2 xu'u" + w" 2 ] dx ^ 0, (31) 

where k is a constant, and a prime denotes the derivative with respect to x. This is 
inequality (2) with 

n = 2, a = 0, b — 1, a 00 = k 4 , an = k 2 ( 1 — kx) 2 , 

U22 = 1, Cloi = k 4 X, ClQ 2 = 0, a 12 = 3k 2 x. 



Not only do the theorems in the previous sections decide whether (31) holds for all 
»eK, the proofs of the theorems also provide the details of direct justification of 
the conclusion as shown below. 

The initial value problem (10) and (11) takes the form 

' y'u = k4 - yn> 

>’22 = k 2 ( 1 - kx) 2 - 2y u - (3k 2 x - y 2 2 )~ , 
y'u = k 4 x - yn + yu(3k 2 x - y 22 ), 

. >’| | (0) = >’22(0) = >i 2 (0) = 0. 

The solution is found to be 

, k 2 x( 1 — 2 kx) 

>n = k 2 x, >22 = 7 , >12 = 0. (32) 

1 — kx 



The solution is bounded in [0, 1] if k e (— oo, 1). In this case, we have 



Y(l) = 



k 4 0 

0 k 2 (l-2k)/(l-k) 



By Theorems 1 and 2, inequality (31) holds when k e (— oo, 1/2], and does not 
hold when k € (1/2, 1). Indeed, when k e (—oo, 1/2], we have, by integrating by 
parts, that 



f [ k 4 u 2 + 2 k 4 xuu' + k 2 ( 1 — kx) 2 u' 2 + 6 k 2 xu'u" + w" 2 ] dx 

Jo 



, 9 k 2 — 2 k 3 n 
= k m“(1) H — u “(1) + 



1 - k 



L ( u " + 



2 k 2 x - k 3 x 2 
1 — kx 



u' ) dx 



> 0. 



On the other hand, when k e (1/2, 1), we choose 
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and arrive at 

• i 



f [ k 4 u 2 + 2 k A xuu + k 2 ( 1 — kx) 2 u' 2 + 6 k 2 xu'u" + u" 2 \ dx 

Jo 



Jo 



k 4 (x — 3 kx 2 + 2k 2 x 3 ) exp 2 ^-^: 2 — k + kx — -k 2 x 2 

= k 4 (l - jfc)(l - 2k) 

< 0. 



Finally, when k e [1, oo), the solution (32) is unbounded at x = I / k. Theo- 
rems 3 and 4 assert that inequality (31) does not hold for some u e V. Here we 
demonstrate it for k = 1. Taking in the left-hand side of (31), 



w = — \/e(4 — 3x) + exp^x — -j 



we find that 

-l 



/ 



[m 2 + 2xmm' + (1 — x) 2 // 2 + 6xu'u" + m" 2 ] dx 
- J e(25 — 66x + 36x 2 ) — \/e(2 + 14x + 4x 2 — 6x 3 ) exp^x — 



+ (2 — 2x — 4x 2 + 10x 3 — 4x 4 ) exp 2 1 x x‘ 



dx 



= e(25x — 33x 2 + 12x 3 ) — Ve(2x + 6x 2 ) exp^x — 



+ (2x — 3x 2 + 2x 3 ) exp“ | x — -x‘ 



= — 3e 
< 0 . 
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Abstract. The paper addresses a problem of robust optimal design of elastic structures when the 
loading is unknown and only an integral constraint for the loading is given. We propose to minimize 
the principal compliance of the domain equal to the maximum of the stored energy over all admissible 
loadings. The principal compliance is the maximal compliance under the extreme, worst possible 
loading. The robust optimal design is formulated as a min-max problem for the energy stored in 
the structure. The maximum of the energy is chosen over the constrained class of loadings, while 
the minimum is taken over the design parameters. It is shown that the problem for the extreme 
loading can be reduced to an elasticity problem with mixed nonlinear boundary conditions; the 
last problem may have multiple solutions. The optimization with respect to the designed structure 
takes into account the possible multiplicity of extreme loadings and divides resources (reinforced 
material) to equally resist all of them. Continuous change of the loading constraint causes bifurcation 
of the solution of the optimization problem. It is shown that an invariance of the constraints under a 
symmetry transformation leads to a symmetry of the optimal design. Examples of optimal design are 
investigated; symmetries and bifurcations of the solutions are revealed. 
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1. Introduction 

A typical structural optimization problem asks for a material layout in the stiffest 
design. The stiffness is defined as an elastic energy of a domain loaded by external 
boundary forces (loading). If the loading is fixed and known, an optimal structure 
adapts itself to resist the loading. However, the optimal designs arc usually unstable 
to variations of the forces. This instability is a direct result of optimization: To best 
resist the given loading, all the resistivity of the structure is concentrated against a 
certain direction thus decreasing its ability to sustain loadings in other directions 
[7, 8, 20]. For example, consider a problem of optimal design of a structure of a 
cube of maximal stiffness made from an elastic material and void; assume that the 
cube is supported on its lower side and loaded by a homogeneous vertical force 
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on its upper side. It is easy to demonstrate, that the optimal structure is a periodic 
array of unconnected infinitely thin cylindrical rods. Obviously, this design does 
not resist any other but the vertical loading. 

The instability to variations of the loading is not a defect of an optimization 
procedure - the structure does exactly what it is asked to do; it is a defect of the 
modeling. In order to find a more stable robust solution, one needs to optimize 
a more general robust stiffness-like functional that characterizes an elastic body 
loaded by unspecified (or partly unspecified) forces on its boundary, as it happens 
with most engineering constructions. To avoid this vulnerability of the optimally 
designed structures to variations of loading, we propose to minimize the principal 
compliance of the domain equal to the maximum of the stored energy over all 
admissible loadings. The principal compliance is the maximal compliance under 
the ex tic me, worst possible loading. We formulate the robust optimal design prob- 
lem as a min-max problem for the energy stored in the domain, where the inner 
maximum is taken over the set of admissible loadings and the minimum is chosen 
over the design parameters characterizing the structure. This formulation corre- 
sponds to physical situations when biological materials arc created and engineering 
constructions arc designed to withstand loadings that arc not known in advance. 

This approach to the structural optimization was discussed in our papers [9, 12] 
and (for the finite-dimensional model) in the papers [18, 19]. Various aspects of the 
optimal design against partly unknown loadings were studied in [1, 5, 8, 21, 25- 
27, 31, 32, 37], see also references therein. In some cases, the minimax design 
problem, where the designed structure is chosen to minimize maximal compliance 
of the domain, can be formulated as minimization of the largest eigenvalue of an 
operator. The minimization of dominant eigenvalues was considered in a setting of 
the inverse conductivity problem in [1 1, 13]. The multiplicity of optimal design that 
we find in the minimax loading-versus-design problem is similar to multiplicity of 
stationary solutions investigated in the engineering problems of the optimal design 
against buckling [14, 34] and vibration [30, 28, 33, 22], 

The structure of this paper is as follows. In Section 2, we introduce an integral 
quantity of an elastic domain, the principal compliance, equal to the response 
of the domain to the worst (extremal) boundary loading from the given class of 
loadings; this quantity is a basic integral characteristic of the domain similar to the 
capacity, the eigenfrequency, or the volume. The principal compliance is a solution 
of a variational problem, which can be reduced to an eigenvalue problem or to a 
bifurcation problem. 

Examples of various constraints for admissible loadings and resulting varia- 
tional problems are considered in Section 3. Particularly, the variational problem 
for the principal compliance with a quadratically constrained class of loadings 
is reduced to the Steklov eigenvalue problem. The principal compliance of the 
domain in this case is a reciprocal of the principal Steklov eigenvalue. We also 
consider the constraints of the L p norm, p > 1, of the loading and inhomogeneous 
constraints and show that the L p norm constraints result in a nonlinear boundary 
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value problem. The constraint of L\ norm of the loading yields to a variational 
problem which does not have a classical solution, but a distribution: the optimal 
loading turns out to be a S -function or, physically speaking, a concentrated loading 
(if such a loading does not lead to infinite energy). 

Section 4 considers robust structural optimization which is formulated as a 
problem of minimization of the principal compliance. The optimal design takes 
into account the multiplicity of stationary solutions for extreme (most dangerous) 
loadings; typically, the optimal structure equally resists several extreme loadings. 
The set of the extreme loadings depends on the constraints of the problem. Contin- 
uous change of the constraints leads to modification of the set of extreme loadings; 
the optimal structure changes in response. This corresponds to bifurcation of the 
solution of the optimization problem. Another characteristic feature of the opti- 
mization problem is the symmetry of its solution. We show that the invariance of 
the set of the constraints for the admissible loadings, together with the correspond- 
ing symmetry of the domain, leads to the symmetry of the optimally designed 
structure. 

Section 5 contains two examples of problems of structural design for uncertain 
loadings. One example is provided by the problem of designing the optimally 
supported beam loaded by an unknown loading with fixed mean value. The sec- 
ond example is a problem of determining the optimal structure of a composite 
strip loaded by a force which deviates from the normal in an unknown direction. 
The force is assumed to have a prescribed normal component and an additional 
component which is arbitrarily directed and is unknown. 

2, The Principal Compliance of a Domain 

2.1. PROBLEM, EQUATIONS, CONSTRAINTS 

2.1.1. Equations 

Consider a domain $2 with the boundary 9 £2 = 9oU9 filled with a linear anisotropic 
elastic material, loaded on its boundary component 9 by a force /, and fixed on the 
boundary component 9o- The elastic equilibrium of such a body is described by a 
system (see, for instance, [35]): 

V • or = 0 in£2, o = C : e, 

1 (1) 
Or = CT T , e(w) = -(VlD + (Vw) T ). 

Here C = C(x) is the fourth-order stiffness tensor of an anisotropic inhomoge- 
neous material, w = w(x) is the displacement vector, e is the strain tensor, o is the 
stress tensor, and (:) represents contraction of two indices. Thus, 

£ ■ cf = ^ C/ or// , (C . c ) i j = ^ (~ ijki Qr ■ 

i,j k,l 
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Equation (1) is supplemented with the boundary conditions 

o-n = f on 3, w = 0 on 9 0 , (2) 

where n is the normal to the boundary 3 £2. These equations arc the first variation 
conditions of the variational problem, 

$(C,f ) = — min ( [ n(C, e(w )) dx — [ w-/ds) 

w:ufao=° \Jq Jd / 

= max ( f w ■ f ds — f n(C, e(w)) dx V (3) 

w:u.|a 0 =0 \J a ) 

where n is the density of the elastic energy: 

n(C,e(w)) = ie:or = ie:C:e. (4) 

The nonnegative functional % is called the compliance of the domain; (3) states that 
it is maximal at equilibrium. At equilibrium, the energy stored in the body equals 
the work of the applied external forces /, 

$o(C, f) = \ [ w-fds= f Tl(C,e(w))dx. (5) 

£ J a Jn 

Simultaneously with the elasticity problem, we consider also a close problem 
of the bending of a Kirchhoff plate (see, for example, [35]). The equilibrium of the 
plate is described by the fourth order equation 



VV : C p i : VVw = f in Q 
with homogeneous boundary conditions 

3 w 

w — 0 on3£2, — = 0 on3£2, 

3 n 

corresponding to a clamped plate, or 

w = 0 on3£2, n J (C p i : W w)n = 0 on3£2, 



(6) 

(V) 

( 8 ) 



for a simply supported plate. Here, w is the deflection orthogonal to the plane of 
the plate, C p i is the fourth-order tensor of bending stiffness of the elastic material, 
VVw; is the Hessian of w , and / is the external loading. Notice that the force / 
enters the equation as a right-hand-side term. The equation for the plate deflection 
corresponds to maximization of the functional 

$pi(C, f) = -f a Q VVu; : C P' : VVw - w f ) dx • (9) 

The results that we develop further in this paper apply to both the elasticity (1) 
and the bending problem (6); therefore, we will drop the subscript in $ p i(C, /), 
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and keep notation $(C\ f ) for both compliance functionals. If this does not cause 
a confusion, we use the same notation w to denote both the displacement in the 
elasticity problem (1) and the deflection in the bending problem (6), even though 
the first one is a vector function, whereas the second one is a scalar function. 



2.1.2. Admissible Loadings 



Let y be a set of admissible loadings /. The elastic energy over a finite domain 
is assumed to be finite. We consider integral constraints to describe the set of 
loadings T : 



y = 




0(/) d,v = 1 



{ 3, for problem (1), 
<2 , for problem (6). 



( 10 ) 



Here D t is a domain of application of the forces: in the elasticity problem (1), 
D f concides with the paid of the boundary 3, whereas for the bending plate prob- 
lem (6), D f is the domain or a paid of it. We assume that 0 is a convex function 
of /, with the derivative i Jr. R 3 — »■ R 3 : 



3 0 

W) = ttt 
OJ 



30 30 30 \ 

WiWiW*)' 



which has an inverse p = 0 1 



2.1.3. Principal Compliance 

We define the principal compliance of an elastic domain in a class of loadings as a 
compliance in the worst possible loading scenario. 

DEFINITION. The principal compliance A of the domain is 

A = max %(C, /). (11) 

The loadings that correspond to the principal compliance A arc extreme or the most 
dangerous loadings’, we denote them as f)> ■ 

A (C) = $(C, f D ) > $(C, f ) V/ e y. (12) 

The most dangerous loadings exist if the set y is closed and convex, see [15]. 



2.2. CALCULATION OF THE PRINCIPAL COMPLIANCE 

The concept of the principal compliance is useful if there arc efficient algorithms 
for computing the extreme loadings. We show here that the problem of computa- 
tion of the principal compliance and the extreme loadings can be formulated as a 
boundary value problem. 
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Consider problem (11) and assume that the loadings arc constrained as in (10). 
The augmented functional J for the problem is: 



J = $(C,f)-n 



(/. 



(p(f)ds - 1 



where q is the Lagrange multiplier. Clearly, max^j: / = max f J . Variation of 
the augmented functional with respect to / gives the optimality condition for the 
extreme loading(s): 

8 f J = f t ?(-/ ' w + V<l>(f)W = 0 . 

J D f OJ 

or, since Sf is arbitrary, 



30 

w — q — = 0 onDr. 

3 / f 



Solving for the extreme loading(s) f D = /, we arrive at the condition 



/d = p(£) (13) 

which links the loading f D to the displacement w at the same boundary point 
for the elasticity problem (1) or at the same point in the domain for the bending 
problem. Condition (13) together with the first boundary condition in (2) allows us 
to exclude / from the boundary conditions, leading to the boundary value problem 
for the displacement w. We arrive at: 



THEOREM 1. The principal compliance A of the elasticity problem (1), (2) with 
the constraints for the class of loadings (10) equals 




where w satisfies the elasticity equations (1) in O with the boundary conditions 

o-n = p( — w) ond, w = 0 on do- (15) 

\T ) 

The Lagrange multiplier q, is determined from the integral condition 




where the function p(-) is an inverse of = 30/3/. 
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Indeed, the displacement w, whose energy is the principal compliance, satisfies 
the elasticity equations (1) in Q with the boundary conditions obtained from (2) 
and (13). The first condition in (15) relates the normal stress at a point on the 
boundary 3 to the displacement at this point. The boundary value problem (1), 
(15), (16) allows us to compute w and /x, f D , and A. 

For the bending problem (6), the calculation is similar. The principal com- 
pliance is the maximum of the functional (9) over all loadings bounded by the 
constraint (10); its value is the following. 

THEOREM 2. The principal compliance A for the bending problem (6)-(8) with 
the constraint for the class of loadings (10) is 




where w satisfies the equation 

VV : C p i : VVu) = ( 18 ) 

together with the corresponding homogeneous boundary conditions (7) or (8). The 
function p (•) is an inverse off = 3 f / 3/. The Lagrange multiplier p is determined 
from 




Indeed, the extreme loading / is related to the displacement w by a scalar 
relation w = pf'if) or / = p(w / p), and the plate equlibrium is described by 
equation (18). 



3. Examples of Constraints 

3.1. HOMOGENEOUS QUADRATIC CONSTRAINT 



Assume that the constraint (10) restricts a weighted L 2 norm of /: 



1 

2 



L 



f T Vfds = l 



or </>(/) = i/>f, 



( 20 ) 



where T (.v) is a symmetric, positive matrix. In this case, p is a linear mapping: 
p{f) = /, and the first of the boundary conditions (15) for the extremal 

loading becomes linear: 



1 1 

— 91 w — a ■ n = 0 on 3. 

p 



( 21 ) 



The optimality condition states that w and o ■ n arc proportional to each other 
everywhere on the boundary 3 with the same tensor of proportionality /r T. 
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REMARK 1. The stationary condition (21) allows for the following physical inter- 
pretation: The boundary 9 is equipped with distributed springs with negative stiff- 
ness. The forces in them arc proportional but opposite to the forces in conventional 
lineal - springs. 

The elasticity equations (1) with boundary conditions (21) form a linear eigen- 
value problem that has a nonzero solution w only if I / p is one of its discrete 
eigenvalues. Eigenvalue l/p relates the displacement on the boundary and the 
normal stress. 

As all eigenvalue problems, the problem (1), (21) represent Euler-Lagrange 
equations of a variational problem: 

1 f 0 e(w) : C : e(w) ds 

— = min ; 

pi »>:ui|a=0 J a w ■ ds 

or 

( [ e(w) : C : e(w) dx f w ■ ib^wds ) — min . (22) 

\Jsi AJd ) w:u ’b=° 

The eigenvalue problem that contains the eigenvalue in the boundary condition is a 
Steklov eigenvalue problem , and p is a reciprocal to the Steklov eigenvalue, see [4], 
The eigenfunctions are normalized by condition (20). 

Using (20) and (21) in the form w = p T/, we observe that the second term 
in (22) is equal to p , thereafter p = A. The Steklov problem has infinitely many 
real positive eigenvalues (see [4, 23]), but the principal compliance of the domain 
corresponds to the dominant eigenvalue, A = /x max . The dominant eigenfunction 
is not necessarily unique; we will demonstrate below that the existence of many 
stationary solutions is typical for the problems of minimization of the principal 
compliance with respect to the structure. The dominant eigenfunctions are the 
extreme loadings. The results are formulated as 

THEOREM 3. If the L i -norm of admissible loadings is bounded, the principal 
compliance A is a solution of the eigenvalue problem: 

V • o = 0 in Q, w = A^kcr • n on 9. (23) 

A is a reciprocal to the principal eigenvalue l/p of the problem (1), (21). 

REMARK 2. The spectrum of the problem (1), (21) has one condensation point, 
zero. Positive eigenvalues px tend to zero but never reach it. This implies that the 
dual problem of minimal compliance does not have a solution: the compliance can 
be made arbitrarily small by choosing a fast alternating loading. 

REMARK 3. The problem becomes isomorphic to the problem of the principal 
eigenfrequency of the domain, if the kinetic energy (and the inertia) are concen- 
trated on the boundary: T = S(x — x h )vj'I l w, where x b e 9. 
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In the bending problem (6), the analogy between the principal compliance and 
the principal eigenfrequency of vibrations is complete. The equilibrium (18) of 
the optimally loaded plate coincides with the equation for the magnitude of the 
deflection of the oscillating plate, 

1 

VV : C pi : VVw) = — w. 
p A 



3.2. Lj-NORM constraint 

Consider the Li-norm constraint for the class of admissible loadings which as- 
sumes that the mean value of loading’s magnitude is fixed: 

[ I / 1 ds = f \fT~f ds = 1 . (24) 

Jd Jd 

From an engineering viewpoint, this case is probably the most interesting one: it 
models the situation when the total weight applied to the structure is known but the 
distribution of the loading over the boundary is uncertain. 

For this, the functional of the variational problem grows linearly as |/| — »■ oo 
which leads to a significantly different analysis. The straightforward variational 
technique does not provide the correct answer. Indeed, the variation with respect 
to / returns the vector condition 

1 

8f: w — ji f = 0 on 3, 

VT7 

which says that 

| w | = constant and w\\f on 3. 

The last condition, together with the condition a ■ n — f (see (2)), allows us to 
exclude / and end up with a pair of conditions on w: 

(a ■ n) x w — 0, |w| = constant on 3. 

Generally, these conditions cannot be satisfied if the 3 -component of the bound- 
ary is adjacent to the component 3 0 where w = 0 since w is continuous. This 
contradiction shows that the naive variational method does not apply. 

REMARK 4. The appearance of discontinuous solutions in the variational prob- 
lems of lineal - growth is well-known [36]. The famous classical example is the 
existence of a non-smooth solution in the minimal surface problem. 

To solve the contradiction, we need to assume that the optimal loading / is a 
distribution. Indeed, the distribution does not have to satisfy the Euler equations 
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of the variational problem because this equation was derived under the assumption 
that the optimal solution / is finite and smooth. 

Dealing with distributions in the L i -constrained set of loadings may cause dif- 
ficulties because the distributions 8 (x — xo) may or may not correspond to a finite 
energy of the elastic system, as is stated in the Sobolev embedding theorem, see, 
for example, [24], For the compliance of the bending plate (9), the energy of the 
concentrated loading and the Green’s function of the corresponding operator arc 
finite. We illustrate this case below considering a one-dimensional example of a 
beam; the concentrated loadings of the type <5 (x — x 0 ) are acceptable because the 
corresponding energy stored in the elastic beam is finite. 

However, the linear elasticity problem does not allow a concentrated loading 
because the corresponding energy is infinite; the Green’s function g(x, y) has a 
singularity, g(x,x) = oo. In this case, the restriction on the class of admissible 
/ can be slightly tighten. We may assume, for example, that the force is piece- 
wise constant within small domains of area e. Alternatively, we may constrain the 
L i +f -norm of the loading, 

\f\ l+€ ds = l, (25) 

where e > 0 is a fixed parameter. This loading can be supported by a I i near elastic 
material, although the displacement w can indefinitely grow when e — »■ 0. The 
analysis of this case leads to the optimality condition 

w 

M’ 

which shows that magnitude of an optimal loading either stays arbitrarily close to 
zero or is very large (of the order of 1 /e). The integral constraint (25) guarantees 
that the measure of the set of large values of f(s) goes to zero when e — > 0. 

With this w arning, we proceed with the formal analysis of the problem with the 
L i constraint assuming that either the limit exists or that e can be chosen arbitrary 
close to zero to preserve the qualitative properties of the solution. 

The extremal loading is concentrated in several points, 

/ = Y^c.^Six -Xj), 

i 

where {x,} is the set of points where the (concentrated) loading is applied, x,- e 3, 
= (£ ; a) , £ ( . <2) , £ ( . (3) ), \^i | = 1, arc directional vectors of the concentrated 
loadings, and c, arc their intensities; due to (24), c,- belong to the simplex 

c, : = 1, a ^ 0. (26) 

i 

Further, we show that the extreme loading is always applied to a single point. The 
displacements uy = w(Xk) are 

Wk = 
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where g(xk, x ,) is the Green’s function which relates the ^-function loading at the 
point Xj to the generated displacement w at the point x*. The compliance becomes 

$ = Yh 'Yh C i C k{^8(Xi,X k)Hk)- 

i k 

The principal compliance corresponds to the maximum of $ with respect to c, , £,■ 
and the points x,- . 

As a function of c, , # is a nonnegative quadratic form, because the work $ is 
always nonnegative. Therefore, $ is a convex function of c, and its maximum is 
reached in a corner of the simplex (26): the maximum $ c of $ corresponds to a 
single concentrated loading c\ = 1 , C 2 = ■ ■ ■ = c p = 0. Next, we maximize 
this maximum §, c with respect to the direction f = (f 1 £j 2) , f J’ 1 ) of the single 
applied loading. The resulting compliance is equal to the maximal eigenvalue 
kmax (x i ) of the Green’s function g(x i, x\) at the point x — x\\ 

= max(§i r g(xi,xi)^ 1 ) = ^(xi). 
fi 

This implies that the applied loading /(x) must be parallel to the displacement 
w(x). Finally, we choose the point xi 6 3 of application of the extreme concen- 
trated loading and obtain the principal compliance A. Summarizing, we obtain 

THEOREM 4. The Li-principal compliance is 

A = max{/4, x (x)}. 

xea 

where An, ax (x ) is the maximal eigenvalue of the 3x3 tensor Green’s function 
g (x , x ) of the problem (1) at the point x G 3. 

We stress that the point X| may be not unique although the extreme loading 
is always concentrated at one point. For example, there may be two symmetric 
extreme loadings if G is a symmetric domain. An example in Section 5.1 below 
shows that there are several equally dangerous loadings in an optimal solution: 
/-max (x i ) = • • • = /-max (x q ) ; the number q depends on the structure. 



3.3. OTHER SPECIAL CASES 

3.3.1. Constrained L p -norm of the Loading 

If the constraint is imposed on the Lp-norm of the loading, i.e., 



~[\f\ p = l, P> h 

P Jd 

the problem has the form (1) but the boundary conditions (21) arc replaced by 

/ M\ 1/(p-1) w 

a - n = q(w), q(w) = ( ) — (27) 



/x 



\w\ 
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and the normalization (16) for fi becomes 

/ 1 f \ l/q 11 

ji = ( — / \w\ q ds ) with — | — = 1. (28) 

\p Ja ) q p 

In this case, the relation between the stress and displacement is nonlinear. Again, 
the multiplicity of stationary solutions that satisfy (27), (28) is expected; this time 
the solutions correspond to bifurcation points instead of spectrum points. The phys- 
ical interpretation is similar to the one given in Remark 1 , but the springs attached 
to the boundary 9 arc nonlinear. 

3.3.2. Nonhomogeneous Constraint 

Let the loading / consist of some known component f° and an unknown deviation 
with a constrained L p -norm: 

II/°-/IIl p <1. (29) 

Applying the previous variational analysis, we conclude that an extremal loading 
can be found from the elasticity problem with a inhomogeneous mixed boundary 
condition: 

a ■ n — f° + q(w) on 9. 

Since the boundary condition is inhomogeneous, w = 0 is not a solution. Still, the 
problem may have several stationary solutions. An example of this constraint is 
discussed later in Section 5.2. 

4. Robust Optimal Design 

4.1. MULTIPLICITY OF EXTREME LOADINGS 

Consider an optimal design problem: find a layout of elastic materials over the 
domain C that minimizes the principal compliance A. Such a structure (stiffness 
C(x)) corresponds to a solution of the extremal problem 

Pmm max = min A (C) , (30) 

C se 

where C is a class of admissible layouts. We rewrite the problem using the defini- 
tion of A(C): 

Pmm max = min max ft (C, /) , (31) 

CsC feF 

where the compliance ft = ft(C, /) is defined in (3). Minimization over w in (3) 
is performed first so that w will satisfy the elasticity equations while interchang- 
ing the order of the extremal operations min Cse and max / g j- correspond to two 
physically different situations. Minimax problem (3 1) is a problem of optimization 
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of the material layout when the applied loading is unknown, while in the maximin 
problem 

^maxmin = max min ${C, /) (32) 

feF CeC 

the loading is chosen to maximize the stored energy and is known to the designer; 
so the design resists this particular loading. If f is a saddle-point functional, the 
solutions to these two problems coincide, and 

P — P 

1 max min — 1 mm max* 

Saddle point solutions arc typical for ‘weak’ control as we will demonstrate below. 
The general case 

^max min < Pmin max 

corresponds to a situation when several loadings arc ‘equally dangerous.’ The stiff- 
ness of the structure C opt should be fairly distributed to resist equally well each of 
these extreme loadings leading to the condition 

$(C op u fi) = $(C opt , fj), fi, fj e O, 
where d> is a set of extreme loadings. 

Generally, the set of stationary loadings may consist of any number of elements. 
They can be found from the following equations, see [16]. Consider a design C opt 
and the functional $(C opt , /). The extremal loadings that solve the variational 
problem 

^(Copt, /) = o, c opt , /) < 0 

arc denoted by fj, i = 1 where p ^ oo; we assume that there arc p 
stationary loadings that can become extreme. The optimized principal compliance 
P m in m ax is determined from the problem 

min max( ^minmax T" ^ E ( $(C op t, fj) ), (33) 

C V, ^0 \ / 

l x 

where v t ^ 0 are the Lagrange multipliers due to the constraints 

$(Copt > fi) P min max ^ 0? ^ ^ L' = 1* 

i 

Optimal design C opt is found from the following conditions that reformulate the 
minimax problem as the problem of minimization of a sum of energies correspond- 
ing to extreme loadings. 
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THEOREM 5. The optimal principal compliance P m ; n max equals 

Pmin max = min max T Vi$(C, fi), 
c sC {v,}:v,> 0^— ' 

1 = 1 

where q is the number of active extreme loadings. 

The nonzero Lagrange multipliers correspond to the equalities 

$o = $(Oopt> fi') i = 1 >•••><?> = 4 > 'f > 0, 

and the multipliers equal zero if the stationary loading leads to a smaller value of 
the functional, i.e., 

$o > $(C opt , fk), k-q + \,...,p =>• Vk — 0. 

These last conditions should be checked in the optimization procedure; that is, 
minimizing $ 0 we check if the value of the functional for the next loading f q+l 
(not the most dangerous one) is still less than # 0 . When this inequality becomes 
equality, the set of extreme loadings should be enlarged to include f q+ \, and the 
corresponding Lagrange multiplier v q+ \ becomes positive. 

The multiplicity of equally dangerous loadings closely resembles the multi- 
plicity of optimal solutions in a well studied problem of maximization of the 
minimal eigenfrequency. The multiplicity of optimal eigenvalues in that problem 
was observed first in a pioneering paper of Olhoff and Rasmussen [30] ; then it was 
investigated in [33, 14, 34], 

REMARK 5. The optimization problem (34) also admits a probabilistic interpre- 
tation. Namely, assume that the optimal loading is a random variable which takes 
q stationary values with some probability rq, . . . , v q . Then the sum r>,#(C, f ) 
in (34) is the expectation of the energy. The optimal design minimizes the ex- 
pectation of the energy, meanwhile the loading chooses probabilities v lt . . . , v q to 
maximize it. 



X> = 1, (34) 



4.2. SYMMETRIES 

Symmetries arc typical for designs that minimize the principal compliance. Namely, 
if the domain and the class of loadings arc invariant under a symmetry transforma- 
tion (translation, reflection, or rotation), then the set of extreme loadings and 
the optimal design arc invariant under this transformation as well. We state the 
following: 

THEOREM 6. If the domain O, the boundary component 3, and the set IF of 
admissible loadings are invariant under a symmetry transformation -P, i.e., 

£2 = <R£2, 3 = Jid, and F~ = SlF r , 
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Figure 1. The force could be applied at arbitrary points along the elastically supported beam. 
The mean value of the magnitude of the force is constrained. 



then the set of extreme loadings and the optimal materials’ layout C are invari- 
ant under this transformation , /.<?., 

$ = #<&, C = IRC. (35) 

Indeed, applying the above consideration we can see that if f 0 e O is an extreme 
loading, then -R / 0 is also an extreme loading. The compliance of the structure 
should be the same for both loadings, which implies invariance of the design para- 
meters with respect to the transformation -R. Particularly, when the loaded domain 
is rotationally symmetric, and the loading can be applied from any direction, the 
optimal layout is axisymmetric. 

REMARK 6. Notice the symmetry of many natural ‘designs' that are perfected 
by evolution: The rotationally symmetric shape of trees allows them to sustain 
wind from all directions; our natural “protective shell”, the skull, provides the best 
protection for the brain against hits from any direction. 

The conditions of the theorem do not require the symmetry of the extreme load- 
ing, only a possibility to apply a loading symmetric to any given one. In contrast, 
the design must be symmetric. 

5, Examples of Optimal Designs 

The following examples highlight the discussed multiplicity of extreme loadings 
and bifurcation of the optimal solution. 

5.1. OPTIMAL DESIGN OF A SUPPORTED BEAM 

5.1.1. Formulation 

Consider a homogeneous elastic beam of unit length simply supported at both ends, 
elastically supported from below by a distributed system of elastic vertical springs 
with the specific stiffness q(x) f 0, and loaded by a distributed nonnegative force 
f{x) f 0. The elastic equilibrium of the displacement w is described by a one- 
dimensional version of (6): 

(£»")" + qw = f, 



w( 0) = w(l) = 0, 



w"(0) = w"( 1) = 0, (36) 
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where E is Young’s modulus. The compliance is equal to 

# = l\fw - | (w"f - | w 2 ^j dx, (37) 

where w is a solution of (36). Assume that the mean value of the magnitude of 
the loading (Li-norm constraint) is equal to one, and the integral stiffness of the 
supporting springs is constrained by a constant k. 

F = {/eff-^O.l): J \ dx = lj, 

a = jr/ G H~ l ( 0, 1): jf q dx = /cj. 

The optimal design problem of minimization of the principal compliance by dis- 
tributing the springs stiffness becomes: 

T’minmax = min ( max#). 

qeQ Xf-.eF ) 

Applying the above analysis, we conclude: 

1. The domain, class of loadings and the boundary conditions arc invariant to the 
translation x — > 1 — x, therefore the design (the springs stiffness) is symmetric 
with respect to the center of the beam, see Section 4.2, 

q(x) = q{\ -x). 

2. Necessary conditions in Section 3.2 show that the extreme loading is a delta- 
function /(x) = <5(x — Xj) applied at one of the points {x \ , x%, . . . , x p ], where 

w'(xj ) = 0, w"(xj) ^ 0. (38) 

The extreme loading may be applied to different points symmetric with respect 
to the center of the beam; the resulting stiffness must be equal. 

3. The stiffness of an optimal spring is a distribution 

q CO = y^M(x - yd, ^Q'/ = k, oc t ^ 0. 

i i 



Indeed, the assumption that q (x) satisfies variational stationary conditions leads 
to a contradiction si mi lar to the contradiction discussed in Section 3.2. Particu- 
larly, the optimal positions of the springs satisfy the necessary conditions (38), 
and therefore the set of reinforcement points coincides with the set {x\ , X 2 , 
. . . , x p }. The number p of the critical points depends on the relative stiffness 
of the springs k/E. 
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Accounting for the loading and springs being concentrated, we reformulate the 
problem (37) for the optimal principal compliance: 




where S ik is Dirac function. 

The response of a supported beam can be characterized by a function 

v(x ) = max g(£,x), (40) 

f 6(0.1) 

where g is the Green’s function of the boundary value problem (36): g(£, x) is the 
displacement ut(f) at the point £ corresponding to a delta-function loading applied 
at the point x, /(£) = 8 (£ — a), and v(x) is the maximal displacement under the 
concentrated force applied at the point x. Figure 2 shows the response v(x) of the 
beam supported by two symmetric springs. The family of the thin curves shows 
the displacements w k (x) under several concentrated loadings applied at different 
points along the beam. The thick curve shows the maximal displacement, v(x). 
Notice that the point of application of the concentrated force is generally different 
from the point of maximum of the displacement curve; see the caption to Figure 2. 
However, the optimal springs arc located at points x' )pt , i = 1, 2, of the maximum 
of v(x), and the extreme loading is the one applied at one of the same points, 
f D = S(x- Ao pt ) or f D = S(x - xl pl ). 

The numerical results demonstrate the following: if the springs arc weak, k/E ^ 
k i, they arc concentrated in the center of the beam. We arc dealing with the saddle- 
point case: the most dangerous loading is a concentrated loading applied also at the 
center. The maximal displacement v(x) is a unimodal function of the position of 
the loading, with the maximum in the center, (v'( I /2) = 0, v"(\/2) < 0). There 

o 

0.05 

c 
CD 

| o.i 

_n5 

Q_ 

U3 
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0.2 

Figure 2. Thin curves: The displacement functions generated by concentrated loadings ap- 
plied at various points along the beam. The thick curve: maximal displacement v(x) generated 
by a force applied at x e (0, 1) as a function of the position of the force. The displacement 
corresponding to the force applied at x = 0.15. has a maximum at a = 0.25. Figure shows the 
responses of the beam optimally reinforced by two symmetric springs. 
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(c) 

Figure 3 . Maximal displacement n(.t) as a function of the position of the applied loading: 
(a) corresponds to a saddle point case, k/E < K\ : the function u(.r) is unimodal, the optimal 
spring and the extreme loading are both located in the middle of the beam; (b) shows v(x ) 
corresponding to k/E in the interval k\ < k/E < K2 when the strong spring is located in 
the center of the beam. Maximal displacement u(.r) is not unimodal; design is not optimal; 
(c) corresponds to k/E in the same interval K\ < k / E < K2, the maximal displacement v(x) 
is shown for an optimally designed beam which is supported by two symmetric springs. 



is only one solution for the optimal applied force and the optimal position of the 
spring: 

f( x )=S^x-^j, q(x) = k8^x - 
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Figure 3(a) shows v(x) for the beam supported by a weak spring in the center 
of the beam. One can see that v(x) is unimodal. If the spring becomes stronger, 
ki < k/E ^ at 2, but is still located in the center, the maximum of v(x) corresponds 
to a noncentral applied force. The equally dangerous loadings could be applied 
in two symmetric eccentric points. The maximum displacement v(x), shown in 
Figure 3(b), is not a unimodal function of the position of the moving applied 
force; the design is not optimal. The optimal design for this case (Figure 3(c)) 
corresponds to two equally stiff springs located symmetrically with respect to the 
center; the design experiences a bifurcation at the critical value of k/E = K\ . 
An optimally supported beam is shown in Figure 3(c), where two strong springs 
arc located symmetric with respect to the center of the beam. The maximal dis- 
placement curve becomes unimodal again, with a large interval of almost constant 
values in the middle. The next bifurcation occurs when k further increases, at the 
point k/E = k 2 . Three springs appeal - after the next bifurcation. The number of 
optimal supporting points increases and tends to infinity when the springs are much 
stronger than the beam, k/E 1. The optimality conditions 

w'(. Xj) = 0, w(xi)\f = f t = constant (i), 

give the optimal position of the supporting springs x,- and a requirement on their 
stiffnesses a,-. 

5.2. COMPOSITE STRIP WITH CONSTRAINED DEVIATION OF THE LOADING 

This example shows the design of an optimal structure for the worst possible load- 
ing. Consider an infinite strip £2 = {— oo <x < oo, — 1 ^ y A 1}. made from a 
two-component elastic composite with arbitrary structure but with fixed fractions 
m A and m B = 1 — m A of the isotropic components. The stiffness of the composite 
C(x,y ) is an anisotropic elasticity tensor; it is assumed that the stiffness can vary 
only along the strip, C = constant (y). 

Assume that the upper boundary is loaded by some unknown but uniform load- 
ing /, 

cr(x, 1) • N — f Vx, 

where N = (0, 1) is the normal vector. The loading / consists of the fixed compo- 
nent /o = (0, 1) directed along the normal and a variable component (deviation) 
( f N , f T ) , the magnitude of the deviation is constrained: 

/ = (/o + f N )N + f r T, f N + rf = y 2 . (41) 

Here T = (1, 0) is the tangent vector and y is the intensity of the deviation. The 
constraint (41) can be rewritten as 

/ = (/o + y cos Q)N + (y sin Q)T fory = l, 
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Figure 4. An infinite composite strip loaded by a force / that could deviate from the normal 
direction. If the norm y of the deviation is smaller than a critical value y\, the optimal com- 
posite is a laminate with layers directed across the strip. If y is greater than j/j, the optimal 
composite is second rank laminate with layers oriented along directions cp and —(p. 

where 9 is the angle of inclination of the deviation of the loading; see Figure 4. The 
lower boundary of the strip is assumed to be loaded by a symmetrically deviated 
force 

/_ = -/ = -(/ 0 + y cos 9)N + (y sin (~9))T for y = -1. 

The symmetry of the loadings results in the horizontal strain being zero, 

e.xxix, y) - 0, -1 < y < 1, (42) 

so that the strain tensor has only two, vertical and shear, nonzero components. The 
stiffness of the composite C (x) is an anisotropic tensor that is assumed to vary only 
along the x coordinate. We consider the problem of optimization of the principal 
compliance of the described domain. 

5.2.1. Design Parameters 

Applying the symmetry theorem, we conclude that: 

1. The elastic properties of the optimally designed structure do not vary along the 
strip, since the design is invariant to the translation x — > x + y. Together with 
the assumption that the material properties do not vary with the thickness, this 
leads to the conclusion that the elastic properties arc uniform: the tensor C is 
constant in x and y. This implies that the stress field a is constant inside an 
optimal strip and 

c T yy = 1 + y cos 9 , a xy = y sin 9 . (43) 

2. The material in the optimal strip is orthotropic with main axes directed along 
the x and y axes since the design is invariant to the reflection x — > — x: 
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This implies orthotropy with the main axes codirected along the x, y axes. 

ci j — Sjj ) 



For the following calculations, we introduce an orthonormal ( a,- 
tensor basis 



a\ = 



0 0 



«2 = 



0 1 

In this basis, the stress tensor a , 



1 0 
0 0 



a = 



°2 

ay o\ 



is represented as a vector 

a = o\a\ + ooro + V2o 2 ,a 2 . 



1 /O 1 

a3 ~V2\l 0 



(44) 



The compliance tensor S and stiffness tensor C = S 1 arc presented as matrices 
with the components { S/j } and {C,y}; their orthotropy implies the representation 

/Su S\ 2 0 \ 

5 = 5 12 5 22 0 

\ 0 0 SyJ 

and a similar one for C. 



5.2.2. The Optimization Problem 

The energy n of an orthotropic material is computed either as a function of stresses 
and compliance tensor S = { Sjj } (stress energy): 

n ff (S, a) = ^(S n cr{ + S 22 cr | + 25 12 a,a 2 + 2 5 33 ct 3 2 ), (45) 

or as a function of strain e and stiffness tensor C = {C ;/ }, 

n € (C , e) = + C 22 e \ + 2Ci 2 eie 2 + 2C 22 e^). (46) 

Recall (see (43)) that two components o\ = a yy and o 2 = a xy of the stress field 
o arc known, and the strain in the xx direction is zero, (42): 



e 2 — 5'i 2 o'i + S 22 <y 2 — 0; 

therefore, ct 2 can be excluded. The elastic energy (46) becomes 

n e (C,e) = ^(C 11 c? + 2C 33 e|) 
or, in terms of stress (see (45)), 

n ff (5, cr) = +2S 33 o^j. 
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The problem of robust optimal design becomes 

P strip = min max n (5, cr), (47) 

CeG m closure feF 

where G m closure is the set of all possible effective compliance tensors of a mi- 
crostructure formed from the two given materials with the compliance tensors S A 
and S B , taken in the proportion m A and m B = 1 — m A , respectively, see [8, 27]. We 
reformulate the problem using a sum of weighted energies, where the minimized 
functional is taken as a sum of the energies due to the extreme loadings. 



5.2.3. Laminates of Third Rank: Symmetry 

The description of the strongest structures that minimize the sum of the energies 
due to several loadings is known, (see the original papers [2, 3, 17] and the books 
[8, 29]); the best structures in 2D arc so-called “laminates of the third rank” shown 
in Figure 5. In 3D, they arc the sixth rank laminates [17]. Structural optimiza- 
tion based on using the third rank composites was effectively developed for the 
multi-loadings case in [6, 10, 25]. The effective compliance tensor S = C -1 of 
a third rank composite - the symmetric fourth-order tensor of elasticity - has the 
representation 

S = S A + m B ((S B - S A )~ l + m A N)~\ (48) 

where S A is the compliance of an enveloping (reinforcing) material, Sb is the com- 
pliance of the material in the nucleus, N is the matrix of structural parameters that 
depends on the structure of the composite, see [8, 29], 

3 3 

N - E a y^ci'i-P (</>,), - 1, cti ^ 0. 

; = l 1 = 1 

Flere E A is the Young’s modulus of the A -material, angles cf arc the angles that de- 
fine the directions of laminates (directions of reinforcement), P is a tensor product 
of four directional vectors z, = (cos (f , sin r/>, ) : 




Figure 5. The schematic picture of the composite of the third rank. 
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P((pi) = Zi ® Zi ® Zi ® Zi, 



(49) 



a, is the corresponding relative thickness of the reinforcing layer in the i th direc- 
tion. 

The above mentioned symmetry of an optimal composite requires the orthotropy 
of the optimal structure. Since the original materials arc isotropic, the structure is 
orthotropic if the matrix N is orthotropic. This can be achieved by setting 

02 = -03 = 0, a 2 -a 3 - a. 

Generally, the optimal strip is reinforced by three layers of strong material; one 
layer (with relative volume fraction 1 — 2a) is directed in the y-direction and two 
other layers (with equal relative volume fractions a) arc symmetrically inclined by 
the angles ±0. In addition, the structure may degenerate into a single layer (when 
a = 0) or two symmetric layers (when a = \) with angles 0 and —0. Because of 
this symmetry, the matrix N for an optimal composite becomes 

N = (1 - 2a)P(0) + aP(</>) + aP{-(p). (50) 



Let us compute the compliance of a third-rank composite in the basis (44). 
Compliance Sa of an isotropic material A is given by a matrix 



5a = 



1 + Va 

E a 



1 - V A -V A 0 \ 

-V A 1 -v A o ) , 
0 0 1 / 



and similarly for the material B. To compute the effective compliance of a third- 
rank laminate, we first represent the matrix P(cp) of (49) in the basis (44), 

( cos 4 0 sin 2 0 cos 2 0 \f2 sin 0 cos 3 0 

sin 2 0 cos 2 0 sin 4 0 \[2 sin 3 0 cos 0 

\/2 sin 0 cos 3 0 \/2 sin 3 0 cos 0 2 sin 2 0 cos 2 0 

and obtain from (50) 

— 2a + 2a cos 4 0 2a sin 2 0 cos 2 0 
2 a sin 2 0 cos 2 0 2 a sin 4 0 

0 0 

The matrix N is the variable paid of the compliance matrix, (see (48)); it depends 
on only two scalar parameters, 0 and a. 

The structural optimization problem (47) finally becomes an algebraic problem 



0 

0 

4a sin 2 0 cos 2 




./ s trip = min max n CT (S(0, a), cr(0)); (51) 

(j),a 0 

the expressions for the quantities involved arc described above. The angle 9 is 
the angle of deviation of the loading from the normal, and 0 and a arc structural 
parameters. 
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5.2.4. Second Rank Structure is Optimal 

Although in the general case of minimization of a sum of energies corresponding to 
multiple loadings the third-rank laminates arc optimal, here the optimal structures 
arc the second - not the third-rank laminates. To prove this statement we must find 
the derivative of Y[ n in the algebraic minimization problem (51), and demonstrate 
that it does not become zero; this would give the optimal value of a on the boundary 
of the constraint. However, we skip this bulky calculation and give a physical ar- 
gument supported by results of numerical optimization. Because of the absence of 
a displacement in the ^-direction, there is no need to reinforce this direction. Even 
more, the stress in the composite does not change if a layer with infinite stiffness 
oriented along x-axes is added to the composition. If this infinitely stiff layer is 
counted, then the structure would be reinforced by three layers of stiff material. 
Since the stiffness of a structure with an infinitely stiff layer is not smaller than 
the stiffness of a structure without such a layer, the optimality of the second-rank 
laminates follows. 

This conclusion is supported by results of numerical optimization, which gives 
o! 0 p t =1/2 for all settings. Physically, this means that the optimal structure is rein- 
forced by either single laminates oriented across the strip (the case when 0 = 0) or 
by a second-rank laminate with two symmetric reinforcement directions 0 and —0, 
see Figure 4. This degeneration of the third-rank laminates can be explained by the 
special geometry of the strip and the loading, which do not allow for any strain e xx 
along the strip, and the assumed independence of the design on the y-coordinatc. 
The formulas for the effective properties of a symmetric second-rank composite 
arc simplified: They arc still given by the expression (48) but the structural matrix 
N is 



N = i(P(0) + P(-0)) 

instead of (50); in the basis (44) it has the form 

( cos 4 0 sin 2 0 cos 2 0 0 

sin 2 0 cos 2 0 sin 4 0 0 

0 0 2 sin 2 0 cos 2 

We notice that the symmetry in this example efficiently reduces the dimension of 
the computational problem, but the general method works with or without symme- 
try. 

5.2.5. Numerical Example 

For the first example, the following values of parameters were chosen: 

mA = 1 — m B — 0.2, E a — 1 , 

VA = v b = 0.3, /o = 1. 




E b = 5, 
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Norm of deviation of the loading from the normal direction 



Figure 6. Bifurcation diagram shows (1) the angle of deviation 6(y) of the extreme, most 
dangerous loading and (2) the angle ij>(y) °f optimal reinforcement of the second rank lami- 
nated composite. Notice that the bifurcation parameter y has different critical values for the 
deviation of the loading 9 and for the angle of reinforcement (f>. 



The relative magnitude y of the variable part of the loading is the parameter of 
the problem; the angle 0 of the optimal deviation of the extreme loading and the 
structural parameters a and e/> arc determined from the solution of the min-max 
optimization problem. We detect three regimes: 

1. When y < yo — 0.31, the extreme loading is vertical, 0 opt = 0, and the optimal 
structure is a laminate with vertical layers directed across the strip, 0 opt = 0, 
see Figure 6. 

2. At the critical value y 0 of the parameter y , the direction of the extreme deviation 
undergoes a bifurcation, 0 opt = ±0(y), shown by the curve 1 in Figure 6. But 
for y < y 1 — 0.46, the optimal structure remains the same: a laminate with 
layers directed across the strip, <p opt — 0 (curve 2 in Figure 6). 

3. When the magnitude y further increases, y ^ yi, the optimal structure bifur- 
cates as well; it becomes a second-rank matrix laminate with the angle <p opt = 
±0(y) (curve 2 in Figure 6). 

Although the problem has two solutions for the extreme loading, the depen- 
dence of the compliance on the parameters 4> and 6 is a saddle -point surface as is 
shown in Figure 7. Indeed, the problem is reformulated (relaxed) accounting for 
non-uniqueness of the loading and for the symmetry in the design. 

The following examples demonstrate the dependence of the optimal solution 
on the ratio of Young’s moduli for the materials in the composite. Figure 8 shows 
the bifurcation diagrams for different ratios of Young’s moduli. Qualitatively, the 
picture remains the same, but the critical values of the bifurcation parameter y are 
different: The larger the ratio, the smaller the critical value of y 0 and y\ at which 
the bifurcation occurs. The interval (y 0 , yQ decreases with an increase of the ratio 
of Young’s moduli. 
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Figure 7. Energy stored in the composite is a saddle point function of the angle of deviation 
of the loading 9 and of the direction of reinforcement 




(a) 




Norm of deviation of the loading from the normal' direction 



(b) 

Figure 8. Bifurcation diagram for different ratios of Young's moduli of the materials in the 
composite ranging from 1 : 2 to 1 : 25. (a) Bifurcation of the angle 9(y ) of deviation of the 
extreme loading from the normal, (b) Bifurcation of the angle of direction of the optimal 

reinforcement for the second rank laminated composite. 
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5 . 3 . DISCUSSION 

The principal compliance is a basic characteristic of an elastic body which de- 
pends only on the shape of the domain and on the stiffness of the material. By 
the proper normalization of A using ||£2|| and ||C||, this quantity is reduced to the 
dimensionless parameter X: 

A 

ii nil lien’ 

and can be treated as a basic integral characteristic of the tilled domain along with 
such properties as main eigenfrequency, the capacity, etc. 

The optimal design aimed to decrease the principal compliance is a minimax 
problem; typically, the problem does not have a saddle point and the optimal de- 
sign provides equal minimal compliance for several extreme loadings. Symmetries 
and relaxation bring the problem to a saddle -point type. Depending on the type of 
constraints, the extreme loading can be a principal eigenfunction of an eigenvalue 
problem, a concentrated loading, or a solution of a bifurcation problem. 
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Abstract. The experimental determination of a strain energy function W for a rubber specimen 
must address departures from an elastic ideal in a rational fashion. Herein, such a rational exper- 
imental method is developed for biaxial stretching experiments and applied to rubber data in the 
literature. It is shown that Rivlin’s representation formula is experimentally ill-conceived because 
experimental error is magnified to the extent that error obscures trends in the response function 
plots. Upon developing direct tensor expressions for the response function calculations, we show 
that Rivlin’s representation formula (or any such constitutive law that has high covariance amongst 
the response terms) magnifies experimental error greatly. By “high covariance”, we mean the inner 
product amongst the response terms in the constitutive law is nearly equal to the maximum possible 
value - i.e., the product of their magnitudes. Moreover, we show that the second partials of W 
with respect to I\ and I 2 should approach infinity as the strain decreases. Using an alternate set 
of invariants with minimal covariance (i.e., a null inner product amongst the response terms), a W 
for rubber can be determined forthwith. 

Mathematics Subject Classifications (2000): 74-05, 74B20. 

Key words: finite elasticity, elastomer, biaxial testing. 



This work is dedicated to the memory of Clifford C. Truesdell whom I have only 
known through his writings. As evident by his publications and our own. 
Professor Truesdell was a man of great reason and our rational investigations of 
mechanics have benefited immensely from his devotion. 



1. Introduction 

By defining strain energy functions of the form W(I \, I 2 ) where I \ and I 2 are 
respectively the first and the second principal invariants of C, the right Cauchy- 
Green deformation tensor, it is possible to find exact solutions to some nonlinear 
boundary and initial value problems in mechanics of incompressible materials with 
behavior that is isotropic and hyperelastic. This approach was pioneered by Rivlin 
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(e.g., [1]). Moreover, many of these boundary value problems can be solved ana- 
lytically to provide universal solutions - solutions that arc valid regardless of the 
specific form of W. 

Nevertheless, to use finite elasticity theory in practice or to verify results in 
solid-state physical chemistry (e.g., statistical mechanics of polymer chains), it 
is necessary to determine a W for a real material. Experimental finite elasticity,* 
however, is in its infancy and is in need of much attention. In paid, this is due to the 
complexity of the task. Yet, even for the most basic case, the results of experiments 
have been inconclusive and often contentious. 

Specifically, let us consider the biaxial stretching of a rubber sheet - a statically 
determinate test on an isotropic elastomer with minimal hysteresis and with nearly 
incompressible behavior. Here, the response functions (i.e., dW/dli and dW/d I 2 ) 
can be calculated directly from the biaxial stretch data. Because of a magnification 
of experimental error in much of the deformation range of rubber, however, the 
functional form of W{I\, 7 2 ) remains elusive in the sense that experiments cannot 
determine it in a definitive manner. 

Ambiguity in determining W{I\, 7 2 ) is, incidentally, used by Rivlin as a justi- 
fication for not reporting a W in his seminal paper with Saunders [2], Rivlin and 
Sawyers [3] state: “. . . was not explicitly presented in the paper, since it was felt 
that other expressions for W could fit the experimental results equally well and it 
seemed invidious to select this one for special mention.” Some of the various forms 
of W(1 1 , h ) for elastomers in the literature [2, 4-9] arc displayed in Table I. 

Additionally, one cannot solve for the I\ and 7 2 response functions for uniaxial 
tests (i.e., uniaxial stretch and equibiaxial stretch) unless one assumes a functional 
form for W a priori. Whereby, calculations of dW/dli and dW/dI 2 for uniaxial 
tests arc only as valid as that which is assumed for the functional form of W (7i , 7 2 ) 
in the first place. As a notorious case in point (see the discussion in [10]), a Mooney 
plot only yields C 10 and C 01 if indeed the test piece behaves like a Mooney material 
(entry 1 in Table I). 

Herein, we show that this indeterminacy for uniaxial tests and this magnifica- 
tion of experimental error in determining W (7) , I 2 ) is due to significant covari- 
ance amongst the response terms in the constitutive law for Cauchy stress or true 
stress t. All constitutive theories with such covariance, moreover, will magnify the 
experimental error that is inherent in tests on real materials. 

To understand why, let us first define the covariance amongst tensors in an 
explicit fashion. Toward this end, let the covariance ratio between second order 
tensors Aj and A 2 be defined as 



R c (Ai,A 2 ) 



abs(Ai : A 2 ) 
I A ! 1 1 A 2 1 



( 1 . 1 ) 



* The experimental determination of constitutive laws for elastic solids that undergo large 
deformations. 
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Table I. 



W = Cio(/ 1 -3) + Co,(/ 2 -3) 


Mooney [4] 


(1940) 


IV = Cio(/i - 3) + Coi(/ 2 - 3) + C 02 (/ 2 - 3) 2 


Rivlin et al. [2] 


(1951) 


W = Cio(/i-3) + C 01 ln(!) 


Gent et al. [5] 


(1958) 


W = C l0 f exp(ci(/i - 3) 2 ) d/i + C 0 i ln(f ) 


Hart-Smith [6] 


(1967) 


IV = CioUi - 3) + Cio(/ 2 - 3) + C 02 In(^ta) 


Alexander [7] 


(1968) 


W = c l0 f exp(ci(/i - 3) 2 ) d/i + Coi ln(f ) 


Alexander [7] 


(1968) 


IV = (flo(7i - 3) - ai (It - 3U 1 + ja 2 (I i - 3 )“ 3/2 (/ 2 - 3) 
+ Ao(I 2 -3)-A 1 ln(I 2 -3)) 


Obata et al. [8] 


(1970) 


W = (C todi - 3) + C 20 (/i - 3) 2 + C 30 (/i - 3) 3 


James et al. [9] 


(1975) 


+ C 01 (/ 2 -3) + C„(/i — 3)(/ 2 - 3)) 



The Cjj, Cj, Aj, and <7/ are all constants. Most of the constants have different symbols in the 
corresponding reference. Obata et al. [8] notation is preserved because W was not given and it 
was necessary to integrate their response functions [8, equation 15]. Rivlin and Saunders [2] did not 
report the IV corresponding to that above, yet they draw such a IV response in their response function 
plots. 

where : A 2 = tr(A]A 2 ) is the inner product of A | and A 2 , and |Ai|, for ex- 
ample, is the magnitude of A | or VAi : A|. It follows (from the Cauchy-Schwarz 
inequality) that: 

(1) * c (Ai,A 2 ) g [0, 1]; 

(2) Rc(A[, A 2 ) = 1 iff Ai and A 2 are colinear;* and 

(3) /?c(Ai, A 2 ) = 0 iff Ai and A 2 are mutually orthogonal.** 

Hence, the covariance is high if Rq(A i ■ A 2 ) is near 1 and low if near 0. 

Let us consider a constitutive law of the form t = — c/I+cti A | +a 2 A 2 wherein q 
is an indeterminate pressure, c/\ and a 2 are scalar response functions and Ai and A 2 
are symmetric, deviatoric and kinematic tensors. Upon separately contracting Ai 
and A 2 onto t, we obtain two equations that can be solved for response functions oq 
and a 2 in terms of stress and strain measurements. As shown in the Appendix, error 
in stress measurements will propagate through response function calculations, and 
the error will be magnified by the factor: (1 — R c ( At, A 2 ) 2 ) -1 / 2 . Hence, if the co- 
variance is high then error will be magnified greatly (with an infinite magnification 
of error if R c ( Ai, A 2 ) = 1). 

For an incompressible material with W = W ( / 1 , / 2 ), the constitutive law for t 
is expressed by Rivlin’s representation formula which is 



* There exists two nonzero scalars a\ and a 2 suc h that a \ Ai + a 2 A 2 is the null tensor. 

** It is understood that Ai and A 2 are mutually orthogonal when their inner product Aj : A 2 
vanishes. Terminology such as “Aj and A 2 are orthogonal” has to be avoided because it is conven- 
tional to say “Ai and A 2 are orthogonal tensors” when A^ = A^ 1 and A^ = A)" 1 rather than when 
Ai : A 2 = 0. 






200 



J.C. CRISCIONE 




Figure 1 . Covariance ratio Rq of the I\ and I2 response terms for the biaxial stretch of an incom- 
pressible sheet. Aj and X 2 316 the in-plane stretch ratios and the data points shown correspond to the 
biaxial stretch tests of Rivlin and Saunders [2]. The light blue and the black data are respectively 
the constant I2 and constant l\ testing protocols. Rq is maximal and 1 iff the response terms are 
colinear. Rq is zero iff the response terms are mutually orthogonal. For much of the stretch domain 
of rubber, covariance amongst response terms is significant in the sense that Rq is close to unity. 



t = —pi + 2 B — 2 B 

dh 3/2 



-1 



( 1 . 2 ) 



where B is the left Cauchy-Green deformation tensor. It follows from the results 
in the Appendix that error in the response function calculations will be magnified 
by the factor: (1 — f? c (dev(B), dev(B -1 )) 2 ) -1 / 2 wherein dev(B), for example, de- 
notes the deviatoric part of B. Figure 1 displays 7? c (dev(B), dev(B -1 )) for biaxial 
stretching tests. Note that it is nearly 1 for moderate strain and it is equal to 1 for 
uniaxial tests. The magnification of error is large and even infinite in much of the 
strain domain of rubber. 

In Section 2, we provide a detailed analysis of the experimental error in as- 
suming that rubber is elastic, and in Section 3, we develop a method of analysis 
that rationally adjusts for this error. In this rational experimental method, we fit 
the data with a stress t w (k l ,X 2 ) that is continuous and hyperelastic like (i.e., 
t w satisfies the necessary conditions for an isotropic hyperelastic material being 
stretched biaxially). To facilitate direct comparison, we use the same to generate 
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response function plots for Rivlin’s representation formula and for a novel repre- 
sentation formula [11]. In so doing, we show that the experimental determination 
of W ( / 1 , / 2 ) from biaxial stretch tests is ill-conceived. The representation formula 
of Criscione et al. [11], on the other hand, is well-posed. This later formula has 
minimal covariance (i.e., null inner products amongst the response terms), and the 
form of W for rubber can be determined forthwith from biaxial stretch tests. 

Furthermore, we show that the second partials of W with respect to I\ and/or 
/ 2 should approach infinity as the strain vanishes. Following Mooney [4], most 
elasticians assume the complete opposite - i.e., they consider rubber to be such that 
d 2 W /d I 2 , 3 2 VT/3/ 2 , and d 2 W /dl\d It vanish. As a simple example of a IT with 
singular second partials, consider W = //|E[ 2 + y |E | 3 where |E| is the magnitude 
of the Green strain tensor and fi and y are positive constants. Such a material 
law has smooth, monotonic behavior that recapitulates 1 i near elasticity when |E| is 
small. Nevertheless, it is easy to show that d 2 W/dlf, d 2 W/dI 2 , and <) 2 W /<) I\ 3 / 2 
go to infinity as |E| vanishes. One cannot rule out the existence of cubic depen- 
dence on strain magnitude in W. Based on the analysis herein (Section 4) and the 
experiments* of Obata et al. [8], one should, in fact, expect singular second partials 
for W(I\, L) when I\ and / 2 approach 3. 

2, Experimental Error in Biaxial Tests on Rubber 

Ideally, a test specimen that is composed of an elastic solid should behave such that 
there is one stress response for each state of deformation. In practice, however, and 
even with measurement error withstanding, stress measurements display** varia- 
tion when one particular' configuration is retested multiple times. If one is inves- 
tigating the inelastic behavior of metals, polymers, etc., then this departure from 
elasticity is of primary interest. Nevertheless, many solids have a range of deforma- 
tion wherein their behavior is predominately hyperelastic in the sense that the work 
done on the specimen is mostly recoverable. For such materials, a hyperelasticity 
framework may be useful for predicting material behavior and for investigating the 
physical origin of the mechanical behavior. 

Yet, if an elasticity law is to be determined from tests on rubber, then it is im- 
perative that departures from an elastic ideal be considered as experimental error. 
A direct relationship between irreproducibility of stress data and experimental error 
is undeniable when one assumes there to be only one stress state for every strain 
state. This fact is neglected, however, in almost all experimental reports on rubber. 
To the knowledge of this author, only Treloar [10] and Jones and Treloar [12] 
address this type of experimental error. 

This type of error is an “error of definition” (e.g., see Beers [13]) as opposed 
to an “error of measurement”. It is error nevertheless, and like measurement er- 
ror, it represents an amount by which we are uncertain of the elastic stresses in 

* In particular, note that the slope of dW /dl\ vs. / 2 becomes steeper as I\ and It approach 3. 

** Provided that the force transducers have a good enough resolution. 
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rubber. Rather than simply report the error as the resolution of the transducer, an 
experimentalist should retest the same values of A| and X 2 in a multitude of ways 
(e.g., loading and unloading). The variance amongst the stress measurements is the 
square of the error in assuming that the true stress is the average of these stress 
measurements at this particular - strain. 

Retests of data points are rarely (if ever) reported in the rubber literature for 
error analysis purposes; however, such retests often arise unintentionally. For ex- 
ample, consider the biaxial stretching data of Rivlin and Saunders [2] wherein* 
there are the data Ai = 2.3, A 2 = 1.91, t\ = 21.5 kg/cm 2 , and t 2 = 16.5 kg/cm 2 
in Table I (constant 7) protocols), but in Table II (constant I 2 protocols) they report 
A t = 2.3, A 2 = 1.92, t] = 21.6 kg/cm 2 , and t 2 = 16.0 kg/cm 2 . These two data 
points have the same A ] , but A 2 of the latter is greater. One would expect t 2 of the 
latter to be greater as well, yet in fact, it is lesser. This difference (0.5 kg/cm 2 ) 
cannot be attributed to measurement error (less than 0. 1 kg/cm 2 ). Hence, one must 
suspect that the experimental error associated with Rivlin and Saunders’ data is 
predominately that which is due to the inelasticity of their specimen rather than 
that which is due to the resolution of their transducers (calibrated helical springs). 

From the hysteresis loop reported by Rivlin and Saunders it should be evident 
that their rubber specimen has a small amount of inelasticity. As a fair quantifica- 
tion of their experimental error, let us assume that the loading curve and unloading 
curve are bounds on the stress variance in such a manner that these curves are 
one standard deviation on either side of an expected mean curve. Whereby, half 
of the difference of the loading and unloading stresses vs. stretch would be the 
experimental error in uniaxial stress vs. stretch. Upon noticing that the hysteresis 
loop is wider at larger strains, a first approximation may be that the experimental 
error is proportional to the stress measurement. In particular - , we estimate that the 
error in assuming elasticity is 2% of the stress measurement. 

A detailed error analysis for tests on rubber remains wanting. However, in lieu 
of such, all error bars in the figures herein are calculated assuming that the error in 
knowing an elastic stress value (at a particular - configuration) is 2% of the measured 
stress value. Rivlin and Saunders tried to minimize the effects of hysteresis; yet for 
the two overlapping data points mentioned above, the t 2 data differ more than 2% 
from their mean. Also recall that the difference is in the wrong direction. 

In order to estimate the propagation of error in the response function calcula- 
tions for each data point in the biaxial stretch tests of Rivlin and Saunders [2], 
use their equations for calculating the response functions with rearrangement as 
follows: 

dW k 2 k 2 , 

— = » « » ~ — —h ~ » — » ^ — —t 2 , (2.1a) 

dh 2(A 2 - A 2 ) (A 2 - X~ 2 X~ 2 ) 2 (A 2 - A;)(A 2 - X\ 2 X^ 2 ) 

* To be fair, there are two other overlapping data sets in these data tables, yet the set with the 
greatest variance is given here. To justify so doing, note that the variance displayed by two data points 
will be typically less than the full variance displayed by multiple data since it is relatively rare to have 
both points significantly deviate to opposite sides of a true mean - i.e., that obtained from many data. 
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dw _ 1 i 

dl 2 ~ 2(Xj - X 2 )(X 2 - X\ 2 X~ 2 ) h 2(X\ - X 2 )(X 2 - X\ 2 X~ 2 y' ( ' 

where t\ and t 2 arc the in-plane principal stresses that correspond respectively 
with the in-plane principal stretches Aj and X 2 . 

Assuming the error in t (at particular' values of A! and X 2 ) is 2% of the measured 
stress, the error in t\, for example, is At] = ±2^/100. The errors At\ and A t 2 are 
unrelated, and as such, the error propagates as the square-root of the sum of each 
term squared, 

/ f_i_ 

\(Aj - X 2 )(X 2 - X~ 2 X~ 2 ) 100 

, ( ^2 h_ 

V (A i - X 2 )(X 2 - X\ 2 X~ 2 ) 100 

Error propagation in the I 2 response function calculation is obtained in a similar 
manner. 
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Figure 2. I\ and I 2 response function plots of the biaxial stretch data of Rivlin and Saun- 
ders [2] with error bars obtained by assuming that the experimental error in the stress data is 
2% of the stress measurement. There are more points in this plot than in Figure 6 of Rivlin 
and Saunders because all of the data in their Tables I and II are plotted here. The symbols 
con'espond to those used by Rivlin and Saunders and they also correspond to those in Figure 1 
herein. The magnification of error is greatest near equibiaxial stretch where, as evident in 
Figure 1 , the covariance is greatest. 
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Figure 2 displays the response function plots of the biaxial stretch data in [2] 
with error bars included. The error bars arc, in general, small for the data in the 
vicinity of pure shear. Not surprisingly, the data near pure shear arc in regions with 
the least amount of covariance (see Figure 1). Data in the red zone of Figure 1 
have substantial error bars - even at high values of I \ and / 2 . Moreover, Rivlin and 
Saunders intentionally did not test regions with a higher magnification of experi- 
mental error. In the moderate strain range (i.e., extensions below 100%), error bars 
would be much larger and approaching infinity as the strain decreases. 



3. Rational Experimental Method 

As shown in the prior section, biaxial stretch data on rubber specimens will display 
departures from the elastic ideal. So too will the data depart from isotropic and 
hyperelastic ideals. To address this error in a rational fashion, we assume that a 
stress measurement t /W at the strain state given by A] and A 2 is 

t*/ = W + At, (3.1) 

where t w is a continuous function of the strain that satisfies the assumptions of 
isotropy and hyperelasticity. The error At represents the amount by which mea- 
surements depart from the isotropic, hyperelastic ideal. 

For a biaxial test with a specimen being stretched A] and A 2 in the associated 
directions ei and e 2 , let t^ be written as 



t w — 7. 2 )ei ® e i + twi (A. t , A 2 )e 2 <8> e 2 , (3.2) 

where t w i and t W2 are scalar functions of A] and A 2 and the sheet surface (with 
normal e 2 ) is traction free. In order for the stress response to be hyperelastic and 
isotropic with respect to the reference configuration, the following constraints 
should be evident 



%i(Ai, A 2 ) — % 2 (A 2 , Ai), 


(3.3a) 


tw 2(^1 > 1/2 ) = 0 


VAj ^ 1, 


(3.3b) 


?wi(A 2 // ”, A 2 ) = 0 


VA 2 ^ 1, 


(3.3c) 


3 h\t\y\ 3A-2^2 




(3.3d) 


9A 2 9Aj 





We sought a continuous, best fit for by defining 9 nodes and 4 elements 
in the (Ai, A 2 ) plane. The nodes have (Ai, A 2 ) values as follows: (1, 1); (1, 1.75); 
(1,2.5); (1.75,1); (1.75,1.75); (1.75,2.5); (2.5,1); (2.5,1.75); and (2.5, 2.5). 
The domains of the 4 elements are: (1) A | < 1.75 and A 2 < 1.75; (2) Ai ^ 1.75 
and A 2 < 1.75; (3) Aj < 1.75 and A 2 ^ 1.75; and (4) Aj ^ 1.75 and A 2 ^ 
1.75. Constraint (3.3d) allows the definition of a potential co( Ai, A 2 ) with A | t w | = 
dco/dXi and A 2 tw2 = da>/dX 2 . Hence, to find a best-fit for t w we sought a co with 
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bicubic Hermite interpolation within each element (C 1 smoothness across element 
boundaries). 

Let there be n measurements of t\, t 2 , Ai, and A 2 . With i = 1, 2, . . . , n; let 
and ty \ be the stress measurements of t\ and t 2 at stretches A 1 ,' 1 and A 2 i] . 
To detemiine the nodal degrees of freedom, we minimized the following error 
function: 



E 



iMJd 
A i 'mi 



3 co 
3Aj 



Ai=A [ / ] 

X 2 =^ 



+ 



iMJd 
a 2 1 M2 



3 co 
3A 2 



A 1= A [ / ] 

A 2 =4'' ! 



(3.4) 



subject to nodal constraints required by (3.3a). Since (3.3a) and (3.4) only constrain 
derivatives of co we enforced <y(l, 1) = 0 in order to obtain a solution. In total, 
there arc 19 independent degrees of freedom to be determined. Conditions (3.3b), 
(3.3c) were not enforced during the fit (except at Ai = A 2 = 1), yet the degrees of 
freedom determined by minimizing (3.4) were such that (3.3b), (3.3c) were nearly 
but not exactly satisfied. Let t^\ be our initial data fit that does not satisfy (3.3c). 
To satisfy (3.3c) exactly, we added the following function to f^ x (Ai, A 2 ): 



4Ti (Ai, A 2 ) 



(Ai - l ) 2 



(A7 1/2 



1) : 



1 (^2 ^ > ^ 2 ) 



Ai ^ L 

Ai < 1. 



(3.5) 



In this work, we do not consider deformations with in-plane compression (i.e., 

— 1/2 

A| < A 2 when A 2 1), and hence the denominator of the fraction in (3.5) is 
greater than or equal to the numerator. Since (j = (A/ l/2 , A 2 ) when A 2 > 0 

and Ai = A/ ^ 2 , the augmented data adjustment, t Wi (= t^ V[ + (j) will vanish and 
thus satisfy (3.3c). A likewise method (yet with Ai and A 2 interchanged) was used 
to enforce (3.3b). 

Although not continuous at (1, 1) in general, (j is continuous in this case. To 
understand why, consider a deformation with Ai = 1 — as and A 2 = 1 + s in the 
vicinity of the reference configuration (i.e. e 1) with s positive.* Since we do 
not consider deformation with in-plane compression, a has its maximal value of 
1/2 when the deformation is uniaxial extension in the e 2 direction. When a ^ 0 
then Ai ^ 1 and = 0. When a e (0, 1/2], we have 



IC 1 I < |-« 2 A 2 (Va;+1) 2 4 1 | <2|^j| 



(3.6) 



for sufficiently small s > 0. The continuity of at (1, 1) follows from the smooth- 
ness of t^ vl and the fact that t^ vl vanishes at Ai = A 2 = 1. 

Rivlin and Saunders’ biaxial data is too sparse for our fitting method because 
data is lacking for small and moderate strains. Jones and Treloar’s data, as tabulated 
in [14], is better with n = 99 and uniform coverage of the (Ai, A 2 ) plane. Figure 3 

* Since we do not consider deformations with in-plane compression, negative e would necessitate 
Aj > 1 and fj = 0. 
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Figure 3. Fit of t w to biaxial stretch data in [14]. The data for both t\ and t 2 are on this 
plot with the A. x values shown directly and with the k 2 t 2 values shown at the point with Aj 
and A 2 transposed (see text). The fit is smooth, monotonic, and it satisfies the isotropy and 
hyperelasticity assumptions. A segment at each data point bridges the data and fit values. The 
two circles highlight regions (with transposed A) and A 2 ) where the data violate hyperelasticity 
and thus depart from the fit (see text). 



plots the data and our fit. Only the surface is shown because (3.3a) requires a 
mirror symmetry such that k 2 t W2 is obtained when k\ and k 2 are interchanged (i.e., 
reflection about a plane given by the condition A.i = k 2 ). The a,' data points are 
shown directly. The data points are displayed by interchanging k\ and k 2 . 

Note that our fit is smooth, monotonic and representative of the test data. There 
are systematic departures - two of which arc circled. Such systematic departures 
also occur when the number of degrees of freedom of the fit is increased (34 inde- 
pendent degrees of freedom obtained from 16 nodes and 9 elements) because these 
departures violate the hyperelasticity constraint. To see this, combine constraints 
(3.3a) and (3.3d) to obtain 



/ 9 

I — — k\twi(k\, A. 2 ) 
\9a 2 






/ 9 

I TT "k 1 f W 1 ( A- 1 , k 2 ) 
\dk 2 



Ai=A| ’ 
X 2 =X\ 



(3.7) 



where A.* and /,( arc arbitrary constants. As highlighted on the left side of Figure 3, 
the surface is flat such that the left side of (3.7) is near - zero. Yet in the region 
highlighted on the right (i.e., the region with A, 1 and k 2 transposed), the data have 
a nonzero slope. Our fit must depart from the data because it models the stress in a 
hyperelastic material - one that satisfies (3.7). 
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Figure 4. Biaxial stretching trajectories with I\ or h held constant at 3.5, 4, 5, 6, and 7. 
Also shown are the equibiax line and the uniaxial stretch curves. The trajectories with I\ 
constant have a radius of curvature (on the equibiax line) with a center toward the origin. The 
trajectories with h constant, on the other hand, have a radius of curvature (on the equibiax 
line) with a center away from the origin. 



4. W (I i , / 2 ) Has Singular Second Partials 

The prior section described how to obtain a stress field t w for rubber that is a 
continuous function of (Ai, A 2 ) and satisfies the assumptions of isotropy and hyper- 
elasticity. With this data adjustment, we can now attempt to find W(I \ , / 2 ). Toward 
this end, consider the ideal testing trajectories in Figure 4 which separately hold I\ 
or / 2 at the values: 3.5, 4, 5, 6, and 7. Note that all deformations with I\ ^ 7 have 7, i 
and A 2 within the domain of the fit displayed in Figure 3. Hence, subsequent plots 
with the 7j axis truncated at 7 arc within the stretch range used to determine tw - 
i.e., we arc interpolating, not extrapolating the Jones and Trcloar data in [14]. 

The response function plots in Figure 5 arc calculated from A;, A 2 , and tw(Ai, A 2 ) 
for the aforementioned I\ and / 2 trajectories. Note that highly nonlinear behavior is 
evident for small and moderate strain despite the fact that tw is smooth, monotonic. 
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and thus linearly elastic in the small strain limit. Singular second partials of W arc 
evident as I\ and / 2 approach 3. Since the error bounds arc large and even infinite 
in the red zone of Figure 1, W (I \ , / 2 ) cannot be found from response function plots 
- i.e., eixor obscures any fiends in the plot. 



5. Determining W(K 2l K 3 ) Is Well-Posed 

Magnification of error is not problematic for all phenomenological theories of rub- 
ber elasticity. In fact, Criscione et al. [11] developed an approach using natural 
strain In V which minimizes the covariance amongst response terms. For rubber- 
like materials, [11] reports* W = W(K 2 , K 2 ) where K 2 = |dev(lnV)| and K 2 = 
piud(lnV). Whereby, the constitutive law becomes 



dW dW 3 J l ~ K 3 

t = — ql H udev(lnV) cudev(lnV). (5.1) 

9 K 2 dK 3 K 2 



Although the operators udev(-), piud(-), and cudev(-) arc not used in [11], they arc 
defined in the Appendix. With simple substitution, this formulation is consistent. 

Note that K 2 is the magnitude of the distortion strain** dev(lnV), and note that 
the K 2 response term is colinear with the distortion strain. As for K 2 e [—1, 1], it 
is the mode-of-distortion (1 for uniaxial extension, 0 for pure shear, — 1 for uniaxial 
contraction), and its response term is orthogonal to the distortion strain. Although 
Ki appeal's in the denominator, an important result from [11] is that dW /dK 2 must 
vanish as order when K 2 goes to zero. 

To solve for the K 2 and K$ response functions, respectively contract udev(lnV) 
and cudev(lnV) onto (5.1) to obtain 



dW 

= udev(lnV) : t, 

dK 2 



dW K 2 cudev(ln V) 

dK * 3/l - K\ 



: t. 



(5.2a) 

(5.2b) 



The orthogonal nature of the response terms (i.e., udev(ln V) : I = 0, cudev(ln V) : 
1 = 0, udev(lnV) : cudev(lnV) = 0) makes isolation of the response functions 
easy. Moreover, |udev(lnV)| = 1 and |cudev(lnV)| = 1. 

With an approach similar to that in the Appendix, it follows from (5.2a) that 
error in calculating 8W/dK 2 is on the same order as the root-mean-squared error 
of the principal stresses. Throughout the entire deformation range of rubber, thus, 
dW/dK 2 can be evaluated without magnification of experimental error. 

* We are liberal with our use of W to generally represent strain energy functions, and W ( K 2 , K 2 ) 
is meant to imply that W can be expressed in terms of K 2 and K 2 . It does not indicate that W depends 
on K 2 and K 2 in exactly the same fashion that W (/j , / 2 ) depends on /j and I 2 . 

** dev(ln V) is referred to as distortion strain because the spherical part of InV solely depends on 
dilatation whereas its deviatoric part does not depend on dilatation whatsoever. 
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Figure 5. I\ and Ij response function plots for the trajectories in Figure 4 with the stress 
given by tw (i.e., the surface fit in Figure 3). Error bars are not shown, yet the error bounds 
are larger than those in Figure 2 and go to infinity near the ends of each curve. 

As discussed in [11], it is appropriate that the dW/dKj calculation be sensitive 
to error for uniaxial, axis-symmetric deformations (i.e., Kj = 1) because the 
response term must vanish in order to satisfy symmetry. In particular, note from 
(5.1) that dev(t) is colinear to dev(lnV) for uniaxial deformations - a necessary 
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Figure 6. Functional dependence of W on AT and K3. W and its derivatives are in units of 
MPa. The top panels show how 9W/3K2 (a) and W (b) depend on Kj. The width of the line 
spans the upper and lower error bounds. A functional form, given by W = g(K2) + ATA(AT), 
is appropriate. Fitting a line to the W vs. K 3 relation at multiple values of K2 yields the 
intercept g(AT) and slope /((AT) which are plotted in panels, respectively. As required by [ 11 ], 
note that g(K2') goes to zero as order K | whereas h(K2) vanishes faster. 



condition for an axis-symmetric deformation superimposed on an isotropic body. 
Because of this, the AT response function has an infinite magnification of error for 
uniaxial tests. 

Yet, it is a simple experimental task to measure how dW/dK 2 depends on AT 
because the K 2 response function is measurable, nevertheless, for uniaxial defor- 
mation and all neighboring configurations. In contrast, neither dW/dI\ nor dW/dli 
can be measured for uniaxial tests because their response terms arc colinear - the 
maximum of covariance. 

Using our data adjustment (i.e., t w in Section 3) to calculate 9W/9AT, Fig- 
ure 6(a) plots dW/dK 2 as a function of K ; when K 2 is held constant at 0.25, 
0.5, 0.75, 1.0, and 1.25. The thickness of each curve is determined by the error 
propagation such that the upper and lower edges are maximal and minimal bounds, 
respectively. With tw being continuous, we numerically integrate 3W/dK 2 (with 
d K 2 = 0.01 and using the trapezoidal rule) at fixed values of AT. In so doing for 
many AT values. Figure 6(b) plots W as a function of AT when K 2 is held constant. 
Note the functional form of W is nearly W = g(K 2 ) + AT/TAT), i.e., linear in AT 
as suggested by [11], When K 2 is held constant, a I i near regression of W vs. AT 
has an intercept equal to g(K 2 ) and a slope equal to h(K 2 ). With such a regression 
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method, we plot g(K 2 ) and h(K 2 ) in Figures 6(c) and (d), respectively. Finding 
W( K 2 ■ K 2 ) is forthright and accurate. 



6. Conclusions 

It is shown that a representation formula for t in rubber is experimentally ill- 
conceived when there is significant covariance amongst the response terms. This 
is so because experimental error, inherent in tests on rubber, is unacceptably mag- 
nified when response functions arc calculated from biaxial test data. This result is 
important because it explains why the most widely used phenomenological theory 
for rubber elasticity (i.e., that of Rivlin) is the most intractable experimentally. In 
contrast, the phenomenological theories of Criscione et al. [11] and Ogden [15] 
contain response terms that are mutually orthogonal (the absolute minimum of 
covariance), and they arc experimentally tractable. Although not yet applied to 
data, the approach of Laine et al. [16] is sure to be experimentally tractable since 
the response terms arc mutually orthogonal. 

Furthermore, the analytical example in Section 1, the data in [8], and the results 
of Section 4 all show that W should have singular second partials with respect 
to 1 1 and l 2 . In contrast, the functional form of W(K 2 , K 2 ) is simple. As shown 
in Section 5, one should expect a W(K 2 , K 2 ) that is linear in K 2 and smooth 
and monotonic in K 2 . The behavior of the second partials is an important matter 
because they appeal - in the equilibrium equations. 

To their credit, Rivlin and Saunders [2] recognized that experimental error is 
magnified unacceptably for moderate strain. Subsequently, they restricted their 
biaxial tests to large strain with 7) and I 2 at 5 or above. By quantifying the mag- 
nification of error in terms of covariance, we show more precisely why W{I\, I 2 ) 
cannot be determined in the moderate strain domain and in some domains of high 
strain (i.e., the red region in Figure 1). Since high covariance is inherent in Rivlin’s 
representation formula itself, it is doubtful that tests such as torsion of a cylinder 
would be able to determine the functional form of W(I\, I 2 ). Yet, this is an open 
question - only biaxial stretching is considered in detail here. 

As for the statistical theory of rubber, many consider W (7i , I 2 ) to be useful or at 
least thermodynamically convenient (e.g., see Treloar’s book [10]). However, the 
covariance amongst the I \ and I 2 response terms is most pronounced in the moder- 
ate strain range where entropy terms (rather than internal energy or crystallization) 
are most likely to predominate. In other words, entropically based formulations 
of W are potentially valid for a strain range in which W(I\, I 2 ) cannot be found 
experimentally. This is a compelling problem for a statistical theory. How unique or 
useful is a IT (I \ , I 2 ) formulation that cannot be independently verified with biaxial 
testing? We conclude that some large variations in W(I\, I 2 ) will perturb the stress 
only minimally because we have shown the reverse to be true - i.e., small variations 
in stress values can give rise to large variations in W{I\, I 2 ). 
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Section 2 focuses on the inelastic behavior of rubber in order to accurately 
quantify the experimental error in assuming elastic behavior. Albeit important for 
assessing error, this inelastic behavior is small in comparison to the total stress 
response. It is worthwhile to note that the stress is predominately (say 98%) depen- 
dent on strain alone when the variation in stress for a given strain is 2%. 

We do not report a specific functional form of W(K 2 , K 2 ) for rubber herein 
because at present there is not an acceptable data set in the literature to so do. 
Although the data in [14] arc useful for showing that W(K 2 . K 3 ) is experimentally 
well-posed, there arc systematic departures in our fit for t w (Figure 3) that arise 
because the data violate hyperelasticity. One possible explanation for this is that 
the specimens may have been held (and stress relaxed) at the uniaxial stretch state 
before they were stretched in the cross direction. Moreover, isotropy cannot be 
verified because only half of the stretch domain is tested. More experimental work 
needs to be done. 

An ideal testing protocol would randomly test and retest a large set of stretch 
values that cover the biaxial stretch domain. Upon randomizing the stretches and 
stretch-rates during the test, the resulting best-fit t w (see Section 3) would be the 
best-guess for the stress if the strain-rate and deformation-history were not known. 
The standard deviation of the data from the tw fit would be the error in assuming 
that the stress in rubber is that which is given by a strain energy function alone. 
Regardless of what experimental method is used, a departure from an isotropic, 
hyperelastic ideal introduces error that must be addressed in a rational fashion when 
trying to determine W. 
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Appendix 

This Appendix shows that there is an inherent magnification of experimental error 
when there is significant covariance amongst response terms in t. Classically, t for 
incompressible materials with isotropic elastic behavior is written as: 

t = — pi + oqB — aqB -1 , (A.l) 

where p is an arbitrary scalar; aq and a 2 arc scalar - response functions, I is the iden- 
tity tensor, and B = FF T is the left Cauchy-Green deformation tensor where F is 
the local deformation gradient tensor. To develop general equations for calculation 
of ai and a 2 , let q — p — oq tr(B)/3 + a 2 tr(B -1 )/3. Whereby, 

t = — ql + aq dev(B) — a 2 dev(B _1 ), (A. 2) 



where dev(-) denotes the deviatoric part of the argument. 
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Two independent equations for ai and a 2 arc obtained by taking the inner prod- 
uct of (A.2) with dev(B) for one and with dev(B _1 ) for the other. In particular, 

t : dev(B) = a\ dev(B) : dev(B) — a 2 dev(B _1 ) : dev(B), (A.3a) 

t : dev(B _1 ) = ct\ dev(B) : devlB” 1 ) — a 2 dev(B _1 ) : dev(B _1 ). (A.3b) 



These equations arc easily solved, yet to express the solution in a more useful 
manner, let us introduce the operator udev(-) which denotes the unit deviator of 
its argument. The unit deviator of B, for example, is dev(B) divided by its mag- 
nitude, or equivalently, udev(B) = | dev(B)| _1 dev(B). Upon solving (A.3) and 
rearranging terms, we obtain 



udev(B) — (udev(B) : udevlB” 1 )) udev(B _1 ) 

| dev(B)|(l — i?c(dev(B), dev(B -1 )) 2 ) 
udev(B _1 ) — (udev(B) : udev(B -1 )) udev(B) 
| dev(B _1 )|(l — i? c (dev(B), dev(B -1 )) 2 ) 



(A.4a) 

(A.4b) 



Although udev(B) and udev(B ') must have unit magnitude, combinations 
of them do not. In fact, the numerators in (A.4a), (A.4b) have a magnitude of 
\j~\~— (udev(B) : udev(B -1 )) 2 . Note further that (udev(B) : udev(B -1 )) 2 is equal 
to /? c (dev(B), dev(B -1 )) 2 , whereby aq becomes 

udev(udev(B) — (udev(B) : udev(B -1 )) udev(B -1 )) 

«i = 1 ' — -p == : t. (A.5) 

| dev(B)|- v /l — i?c(dev(B), dev(B *))- 

Now the tensor in the numerator always has unit magnitude, and hence the sum of 
the squares of its principal values is unity (i.e., = 1 where §,■ arc the 

principal values of the tensor in the numerator). 

As necessary for isotropy, B and t arc coaxial. It follows that any combination 
of B and B _l is coaxial to t. Consequently, (A.5) becomes 

H\h + Hih + 

a i = , (A.6) 

| dev(B)| v /l — i? c (dev(B), dev(B -1 )) 2 

where the t, arc the corresponding principal values of t. Assuming that the error in 
knowing each of the principal values (i.e.. At] , A t 2 , and A r 3 ) of t is uncorrelated 
to the other principal values then the error bounds of cyi arc 



Aaq = ± 



y At I + $2 ^ 2 + $3 A t| 

| dev (B) | y/l — /? c (dev(B), dev(B -1 )) 2 



(A.7) 



where Atj\ for example, is the variance in t\. If the t, have a similar variance (i.e., 
A t\ = At 2 = At 2 = A 1q), then the vaiiance of a\ (i.e., Aoq) is 



Aaj 





| dev ( B ) | 2 ( 1 — /? c (dev(B), dev(B -1 )) 2 ) 



(A. 8) 
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It should be evident that error in t will be greatly magnified when A’ ( (dcv(B), 
dev(B -1 )) is near 1. 

To generalize for all isotropic, incompressible, elastic materials, note that one 
does not have to use B and B _1 as the kinematic tensors in t. In particular, we may 
write 



t — — <j I + crjAi + o' 2 A 2 , (A. 9) 

wherein a 1 and a 2 are scalar response functions and Ai and A 2 arc deviatoric 
tensors that arc linearly independent combinations of dev(B) and dev(B _1 ). With 
an approach similar to the above, it follows that the error in the calculation of a\ 
and ot 2 will grow as R c ( Ai, A 2 ) approaches unity. 

In order to avoid magnification of experimental error, hence, an ideal choice 
for Ai and A 2 would be combinations of dev(B) and dev(B _1 ) that arc mutually 
orthogonal so that Rq(A\, A 2 ) vanishes. Moreover, if these combinations were 
normalized such that | A 1 1 = [A 2 | = 1 then contraction of A 1 and A 2 onto (A.9) 
would separately yield 

a.\ — t : Ai, a 2 = t : A 2 . (A.10) 

In so doing, the error in the response function calculations would be on the same 
order as that of At itself. The response functions and the stress would have the 
same units and the same variance. 

REMARK. Such an orthogonal tensor basis with normalized magnitudes can be 
defined rather easily (see [17] for verification of all statements in this remark). 
Toward this end, let D be a linear combination of B and B 1 . udev(D) will be a 
combination of dev(B) and dev(B -1 ) that has unit magnitude. As for a tensor that 
is orthogonal to udev(D), use the complementary unit deviator of D, cudev(D), as 
given by 

! /tan + 3 piud(D) udev(D) - 3^6 udev(D) 2 

cudev(D) = , (A. 11) 

3-y/l — piud(D) 2 

where piud(D) = 3\/6 dct(udcv(D)) is the principal invariant of the unit deviator 
of D. Since tr(udev(D)) is zero and tr(udev(D) 2 ) is unity, udev(D) only has one 
principal invariant, det(udev(D)), which happens to be bounded by ±(3\/6)~' . 
Hence, piud(D) e [—1, 1], and it is such that piud(D) = —1 iff D is like the 
strain of uniaxial contraction* and piud(D) = 1 iff D is like the strain of uniaxial 
extension.** With generous use of the Cayley-Hamilton equation for udev(D), it 
is possible to show that cudev(D) : cudev(D) = 1 and cudev(D) : udev(D) = 0. 
Being deviatoric, it should be evident that udev(D) : I = 0 and cudev(D) : I = 0. 

* For uniaxial, isochoric contraction, strain tensors have one negative principal value and two 
positive ones that are equal. 

** For uniaxial, isochoric extension, strain tensors have one positive principal value and two 
negative ones that are equal. 
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Abstract. We introduce a suitable notion of generalized Hessian and show that it can be used 
to construct approximations by means of piecewise linear functions to the solutions of variational 
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1. Introduction 

The approximation of second, or higher order, variational problems by standard 
conforming methods is quite cumbersome, since it requires the continuity of the 
first, or higher order, derivatives across the mesh elements. It is therefore preferable 
to use alternative methods that with various strategies provide external approxima- 
tions to the solution, that is approximations within spaces of functions that are less 
regular than what would be required by the variational problem. Several families 
of these methods have been thoroughly studied in the past decades and a well 
established theory has been developed. 

In non-conforming finite elements the regularity requirements across the mesh 
elements are relaxed and an external approximation is obtained by simply consid- 
ering bases of non-conforming functions. The possible jumps and discontinuities 
at element interfaces resulting from nonconformity are completely ignored and 
just the sum of the contribution over the mesh elements is taken into account. 
The convergence to the solution is then assured for the so called consistent fi- 
nite elements [7], The mixed methods instead provide external approximations 
by enlarging the list of primal variables and adding suitable constraints as side 
conditions. By the introduction of Lagrange multipliers the minimum problem is 
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then changed into a saddle point problem. Obviously to solve the latter problem a 
peculiar choice of algorithms is needed [4, 17]. 

Here we follow a different approach that does not make use of Lagrange multi- 
pliers nor ignores the discontinuities across the mesh elements. We consider vari- 
ational problems of second order in a two-dimensional bounded domain, and give 
an approximation scheme using spaces of piecewise I i near functions defined for a 
chosen sequence of triangulations of the domain. Precisely, we introduce a notion 
of generalized Hessian, based on a discrete Green’s formula, and show that it en- 
dows the union of these discrete spaces with a sequential topology that makes it 
dense, in an appropriate sense, in the function space in which the given variational 
problem is defined. This is established under a mild assumption on the triangula- 
tions. Then, for a generic functional of integral type whose integrand is convex with 
respect to the Hessian and satisfies a standard quadratic growth condition, we con- 
struct a sequence of functionals defined on these discrete spaces and prove that it 
T -converges to the given functional. Moreover, when the integrand is strictly con- 
vex, the minimizers converge to the minimizer of the original problem. So this 
approach provides an approximation technique. 

All this generalizes ideas discussed by Davini [10, 11] and Davini and Pitacco 
[12, 13]. Credit must also be given to an early paper by Glowinski [15] that proba- 
bly did not receive the attention it deserves. 

Applications and the crucial issue of estimating the convergence rate arc not 
considered in the present paper (see however the related paper by Davini and 
Pitacco [13] where the rate of convergence for the biharmonic problem is studied 
within the general framework of the mixed method). Our attention is rather focused 
on general aspects of the method and, particularly, on its connections with the 
T -convergence of functionals. Although it is customarily used for different scopes, 
it seems to us that the framework of T -convergence lends itself quite naturally to 
approximation purposes. In particular, as in this case, it may lead in a direct way 
to the introduction of sequences of unconstrained minimum problems for func- 
tionals defined in non-conforming spaces and whose minimizers provide external 
approximations to the solution. 

Our results are not confined to the quadratic functionals and cover a fairly broad 
class of problems. It is worth recalling that various authors have proposed for 
the quadratic case approximation techniques that turn out to be si mi lar to ours, 
although they have been worked out in a different perspective. We mention, among 
others, the works by Bhattacharyya et al. [3, 16], who treated the case of linear 
anisotropic plates within the scheme of the mixed methods, and those by Angelillo 
et al. [1, 2], who adapted the argument of Davini and Pitacco [13] for the loading 
problem of plane anisotropic linear elasticity. 

2. Discretization of the Domain 

Let G c M 2 be an open bounded domain with smooth boundary. Let 7), := 
{T/}j = i Ph , with h taking values in some countable set W of real numbers, be 
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a sequence of triangulations of 12 regular in the sense of Ciarlet [6], i.e., such that 
the ratio between 

pi, = inf sup{diam (S): 5 is a disk contained in T , } and 

i s 

h := sup{diam Z)} 

j 

is bounded away from zero by a constant independent of h. We denote by x, the 
vertices of the triangles Tj and call them the nodes of the mesh . We indicate by 

Ph := {1,2 P h ] and Nf, := {1,2 , . . . , N/,} the sets of values taken by the 

indexes of the triangles and the mesh nodes, respectively. We shall call 7/, the 
primal mesh. Denoting 



<2, := (J Tj 

Tj&Th 

we require that 12/, invades 12 from inside. 

Following Davini and Pitacco [12, 13], for each h e W we also introduce a 
dual mesh T], := {7] };=i,...,n a consisting of disjoint open polygonal domains, each 
containing just one primal node, as shown in Figure 1 where the dual elements 
are drawn with dashed lines. We assume that the sequence of dual meshes is also 
regular and that 



= U Tj. 

TjeT h 

Let X i, be the space of functions which are affine on Tj and continuous on Q lt 
(briefly, the polyhedral functions over T/,), and let X 0h c X h denote the set of 
functions that vanish on 312/,. We regal'd X 0h as a subspace of //,] ( 12) by extending 
the functions to zero in 12 \ Q h . 
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Let i fi be the polyhedral splines in X h defined by the condition that (fit (xj ) = <5, ; - 
for i, j = 1, . . . , /V/, . We assume that 

\ T i\ — J <Pi dx (= ^|supp(<p,)|^ ( 1 ) 

and call it ASSUMPTION (HO). Note that it is always possible to construct a mesh 
with this property, e.g., by taking the nodes of the dual mesh to be the center of 
mass and the middle points of the sides of the triangles of the primal mesh. 

In what follows, in order to keep the notation simple, we sometime avoid la- 
beling by h certain quantities that are mesh dependent, such as, for instance, the 
mesh nodes or the mesh elements, assuming that that dependence is clear from the 
context. Also, it shall be useful to distinguish between the internal nodes, which arc 
those that do not belong to 3S2/, and whose indices take value in the set T/, c A/,, 
and the boundary nodes which arc those sitting on it. 



DEFINITION 1. We shall say that the sequence of partitions considered has the 
Property (★) if the fourth-order tensor 






Vtfij <g> V(f) k dx j <g> [(x 7 - - x k ) <g> (Xj - x k )] 



satisfies the following inequality 

^ 1, 



lim sup sup | 

h ke!th 



Ak) 

F /i 



where || • [| denotes the sup-norm, i.e., 



Id := sup h 

H^O 



iChi 



HI 



H ranging over the space of second order tensors. 



A similar condition was required by Glowinski [15]. Angelillo et al. [1, 2] 
considered instead meshes with the following property (Property (AFF)): 

N h r 

/ H(x — Xj) ■ (x — Xj)Xcpj <g> V(p k dx = 0 VH 
.•_! dQ/ ; 

for every node x k . In what follows we prove a couple of lemmas implying that if the 
mesh satisfies Property (AFF) at the internal nodes, then it has the Property (★). 
As we shall see our assumption that 12/, invades D is sufficient to control also the 
contribution coming from the boundary nodes. 

REMARK 1. The fourth-order tensor G ( /: k> maps second-order tensors into sym- 
metric tensors. To deduce this it suffices to show that f Q Vipj <g> V<p k dx is a sym- 
metric second-order tensor. Note that if xj and x k are not the nodes of the same 
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Figure 2. Two triangles of the primal mesh. 



triangle then the integral considered is equal to zero. So, let us fix attention to any 
node Xk and adopt local labels t — 1, 2, . . . , t k to denote the nodes all around it 
and the respective primal triangles. Also, choose counterclockwise ordering and 
indicate by n ; the unit normals to the sides joining the central node x k to x\ k) . 
Let T r ikl and T t (k \ be the triangles having the side (x/., x ( r k> ), with xj = x\ k) , in 
common. The situation is represented in Figure 2. After denoting with j and 
(p\ k] the restriction of (pj on T t (k \ and T, (k \ respectively, we may write 

/ Vtpj <8> Vfo dx = / (8) Vfa dx + / Vq> ( t k) ® Vfa dx, 

Jn h JT t ( *\ Jt* ) 

from which, by applying Green formula, it follows that 

/ V0; 0 V&dx 

J Qj, 

= f , V (p (k) <g> n r+ i^.dr 
* x k ’ x( ,+ 1 

- f [|V*f|] ® n r &dr - f V0® ®n ; _! 0 k dr. 

Jxk.xf* Jxk,x ( ( _\ 

Now, the gradients V<p, (A) and have the direction of n /+] and n f _i, respec- 

tively. So, the integrands of the first and third integrals arc symmetric dyads. On 
the other hand, by Hadamard lemma, [|V<p, (A) |] = [ | (p\ k) n \ | n ; and the second integral 
also takes on symmetric values. 

The following simple lemma shall be useful in what follows. 

LEMMA 1 . Let x k be an internal node. Then, for all H, the identity 



2|7yfc| 



V , pi ® S7<p k dx 



holds true. 
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Proof. We notice that, if W is skew-symmetric, equation (2) is satisfied because 
G^’W = 0 by the definition. Hence, without loss in generality, let H be any 
symmetric second-order tensor. Then, from 

Nh -1 f 

Gj, A) H = y2 — H(x ; - - x k ) • (Xj - x k ) / Vfj ® V<p k dx, 
j=l 2\h\ J n 

and by taking into account that V<^/ = O' since <Pj = 1' and that 

x j < Pj(x) = x we find 




By a Green formula the last integral on the right hand side vanishes because (p k 
vanishes at the boundary of supp <p k when x k is an internal node. Thus, equation (2) 
holds true. □ 

The next lemma shows that Property (★) is more general than Property 
(AFF). 

LEMMA 2. An internal node x k owns the property (AFF) if and only if G^ ] = I, 
where I is the fourth -order identity tensor. 

Proof. Without loss in generality it suffices to consider a generic symmetric 
tensor H. Then, 




Hx • xj V fj ® V(p k dx 





Hx ® Vf> k dx 




(pk dx, 




V(Hx)<p,t dx 
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where we have applied Green’s formula and taken into account that x k is an internal 
node so that 0 k vanishes on the boundary of supp <p k . By ASSUMPTION (HO) we 
then obtain 

N h 

J2 I Ha x j V<pj 0 V& dx = -H| % \ . (3) 

j | 2 q / ; 

Moreover we have 




H(x — xj) ■ (x — xj)V(fij 0 V(pk dx 




N h { 

0 V(p k dx — 2 / Hx • XjV(pj 0 V^. dx 

j=\ “ 



+ 




Hx j ■ XjV<Pj 0 Vi p k dx 



and by Lemma 1, equation (3) and noticing again that <Pj = we obtain 



tv* „ 

y; / H(x — Xj) • (x — Xj)V(pj 0 V^. dx = 2|7*|(H — G®H). 

j=\ ” 

From this identity it then follows that 
Gf’H = H VH 

if and only if Property (AFF) applies. □ 



As a consequence of Lemma 2 we deduce that Property (★) holds whenever 
the chosen triangulation forms hexagons or half hexagons formed by triplets of 
equal isosceles triangles, see Figure 3, since it was proved by Angelillo et al. [1, 2] 
that in this case Property (AFF) holds. 

We now look at the case in which the primal mesh is generated by a rectangular 
grid of nodes, see Figure 4. In this case, after tedious calculations similar to those 
done in Remark 1, we deduce (with the notation shown in Figure 4) that G® = y I, 
where 

3(1 + af}) 

2 + 2a/3 -p a + 

and where a := a\/ai and := b\/b 2 - Hence Property (★) holds provided 
y ^ 1, that is for ({cr ^ 1} Pi {/l ^ 1}) U ({/l ^ 1} D {a ^ 1}), cf. Figure 4. Note 
that Property (AFF) holds only for either a = 1 or ft = 1. 
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Figure 3. Partition for which property PROPERTY (AFF) holds. 




Figure 4. a := m/ a 2 , /3 := fei/^2- 



3. Generalized Hessian 

In what follows we are interested in dealing with functions in X$ h and shall imag- 
ine them extended to zero in the whole of M 2 . Let v e X 0 /,. Then, its second 
distributional derivatives arc defined by (v, a p , i/ / ) = — /„, v, a xjj , p dx, for all 
functions x{/ e Cq°(R 2 ). Hence the Hessian, D 2 v, is a symmetric linear operator 
from C q° (M 2 ) into the space of two by two real matrices defined by 

(D 2 v,\lr) = -j Vv®VxJ/dx. 

By density we can extend this operator to the Sobolev space H l (£2/,). We shall still 
denote this extension by D 2 v. It follows that 

(D 2 v,i/) = -l V0<g>VVrd.r, &eX h . 

J 
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Noticing that every function i// g X h can be written as \ ]r{x) = x 

<Pj(x)), we can write 

Nh r N h 

(Z) 2 n,!A) = -£ / Vv ®V<pjdx^(xj) = ^tl h v(x j )\if(x j )\Tj\ (4) 

j= 1 7 = 1 



for every v G X 0/l and every i Jr G X/,, where we have set 

H/,0(x ; ) := [ Xv®X(pjdx. 

J \Tj\ Jn h J 

Given v G X 0 /, , we define the tensor valued function 



(5) 



N h 

H,,{’ := ^2u h v(xj)xf r (6) 

7 = 1 

where y?, denotes the characteristic functions of 7) . We will call H/, v the general- 
ized Hessian of v. 

Furthermore, given a continuous function / we define 



N h 

Chf(x) := ^2f(xj)xfj(x), 
7=1 
Nh 

nf(x ) := Y2f(xj)Wj(x). 
7 = 1 



(V) 



Obviously, if v e X/,, then r h v = v. For every v G X 0 /, and i } G X h , then, from 
equation (4) we deduce that 

[D 2 v, xjs) = f U h vc h x}dx. ( 8 ) 

For later use let us also denote by 



o * ^ 

H i,v := 2_^ H h v(Xj)xf. 
je$h 



(9) 



the simple function that coincides with the generalized Hessian of v on the inter- 
nal dual elements and vanishes outside. Note that, when x Jr G X 0h , equation (8) 
becomes 



(D 2 v, x A) = X! ^-hv(xj)xj/(xj)\Tj\ 

je$h 




Yli,v ci, x/r dx. 



( 10 ) 
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REMARK 2. From what we have shown in Remark 1 it follows that the gener- 
alized Hessian is a symmetric tensor valued function for every v e X 0h . Its trace 
turns out to coincide with the generalized Laplacian defined by Davini in [11]. 

It is worth noticing an interesting interpretation that can be given to the gener- 
alized Hessian when the boundary of 7) intersects at the midpoints the edges of 
the primal triangles that concur at x r From calculations similar to those made in 
Remark 1 we get in fact 



H/,D(x 7 ) = 



f 

1 tj 1 

— E 
i^i tr 



[|v,«|]n f 0 n r <p r (;) ds 



Y 2^ Xj ® n o 



(11) 



where [|0,„|] denotes the jump of the directional derivative of v in the direction 
normal to the primal side (xj, x 2j> ). It is easy to see that X h is contained in the 
space of functions with special bounded Hessian, SBH(£2), cf. [5], and that the 
distributional Hessian of v € X h is a Radon measure of the form 

D 2 v = [|u„|]n(8>nd3 ( f 1 L(5(Vi))), (12) 

where W 1 [(S(Vv)) denotes the one-dimensional Hausdorff measure restricted to 
the set where the gradient of v is discontinuous. By recalling that the boundary of 
Tj meets the sides of the primal mesh at the midpoints, we get 

D 2 v(Tj) = Y 2 1 (x J -*f 0) )|[lF«l] n r (13) 

t= l z 

when this measure is calculated on Tj. Therefore, by (11) 2 it follows that 

D 2 v(Tj ) 

H h v(xj) = — (14) 

which motivates us to regal'd the generalized Hessian of a polyhedral function at 
Xj as the mean value of the Hessian over Tj. 



4 . Some Properties of a Sequential Topology in U/ !e ac ^oh 

The set U /i69c 2f 0 / ; is obviously dense in // ( J (f2), with respect to the H 1 norm. 
The notion of generalized Hessian introduced in Section 3 can be used in order 
to endow the space X 0h with a sequential topology that makes it dense 

in H 2 (^l) H Hq(Q), or in an appropriate sense, and allows us to de- 

fine converging discretization methods that provide external approximations to 
the solutions of variational problems involving the Hessian. This generalizes ideas 
discussed by Davini [10, 11] and Davini and Pitacco [12, 13]. 

It is routine to show that: 




GENERALIZED HESSIAN AND EXTERNAL APPROXIMATIONS 



227 



LEMMA 3. There exist constants c and C, which do not depend on h, such that 
c\\c h v\\ 2 L2 ^ < ||l|| 2 l2(Q) < C\\c h v\\ 2 ma) VO e X h . 



Proof. Let v e X /t and Tj be one of the primal triangles. Then, 

v(x) = X v(x tl )(p t i (jc) for x e Tj, 
tieJfhU) 



where the sum extends to the nodes that are vertices of Tj and X h (j) c X), denotes 
the set of values taken by their respective indices. Hence, we have 




X )(/ 

tutieXhU) XJTj 



<A, <Pt 2 d-r )v(x, 2 ). 



So, the matrix (f T <p ti f, 2 dx) is strictly positive. One can use an affine change of 
coordinates x — > x’ to transform Tj into a normalized triangle, obtaining that 

<Pti dx^ = | Tj | K h , 2 , t x ,t 2 & X h (j), 

with K nh the entries of a 3 x 3 symmetric matrix which is also strictly positive and 
independent of j. It follows that there arc positive constants A and A such that 

M T jl X v(x tl ) 2 ^ f \v\ 2 dx ^ A\Tj\ X v(x tl ) 2 . 
tieJfhU) jTj 




By summing up over the primal triangles and reorganizing the sums in the first and 
last term suitably, it follows that 

AX^(x/) 2 X l r /l ^ / l^| 2 d.x ^ A X v(xi) 2 X \ T il 

ieX h jetPiAi) jQh ieJth jeS^i) 



where tpf,(i) C •V’/, is the subset of index values relative to the primal triangles 
that have x,- as a vertex. By observing that j & s> h (,•) I Tj \ = |supp | and recalling 
Assumption (HO), we find that 

3aX«(x/) 2 I^K f |0| 2 dx < 3A X u(xc,-) 2 |Z} | 

ieJt h ieJt h 



which is our thesis. 



□ 



LEMMA 4. There exists a constant c, independent ofh, such that 
l|Vf|| L 2 (Q) ^ c||H/,i)|| L 2 ( j 2) Vi) e X Qh . 
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Proof. This is a straightforward consequence of equation (10) and Lemma 3. In 
fact, by applying the Cauchy-Schwarz inequality to equation (10), with i jr equal 
to v, and the Poincare inequality successively, we deduce that 



/ 

J £lh 



V v (8) V v dr 






Hhv\\ L 2 { a)\\ c hV\\LHsi) < c||Hfcu|| L 2 (n) IIV0|| L 2 (O) 



for some constant c. But 

|2 

!l 2 (Q) 



l|VC||7 



= / • f Vv 

J £2;, 

j 

J Q 



V v dx 



€ V2 



V v 0 V v dx 






C | H/, ^ II L 2 (Q) II V f II /,2 (n) . 



(15) 



Hence, the lemma is proved. □ 

We now prove a compactness theorem. 

THEOREM 1. Let v h e X 0h and v e L 2 (Ll). If 
Vh — »■ v in L 2 (£l) 



and 

supIH/T/i || L 2 (Q) < +oo, 
h 

then 

v e h 2 (Q) n H^(Q), 

{)/, — » v in H l (Q), 



and 



H hVi, — ^ V 2 v in L 2 (Q). 



Proof Since sup /( HH/tIIlTq) < +oo, from Lemma 4 we deduce that 

sup||Vf,,|| i 2 (Q) < Too, 
h 

and hence 0;, — ^ v in H 1 (Q). But 0/, e H ( \ (O) and hence also v € H { \ (£2). Let 
/ e C~(£2). Then, for h small enough supp / <s Ll h and we have that 

[ V dh ®Xr h f dx = 

JQ 



= L V v h <S) V<Pj dx ) f(xj) 

j&H 2 ’ 

= - 'Yh I T i I H/1 Vh (x j ) f (• Xj ) = - / H h Vh C h f dx 

jeH 

= - HhVhfdx- / H h Vh(c h f - f)dx 
Jo. Jo. 
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and hence 

f Vu,,<g>V/ dx = - f H h v h fdx- ( mvhichf - f)dx 
J £2 J J £2 



/ 

Jn 



+ / Vv h ®V(f -r h f)dx. 



(16) 



Let s > 0. By using the fact that Chf — ► / in L 2 (Q) and r h f — > f in H 1 (Q), 
we find that for h sufficiently small we have 



/ H 



hVhichf - f) dx 



+ 



/ 

Jn 



Vv h ® V(f-r h f)dx 






£ II/IIl 2 (q>- 



From equation (16) we find 



/ 

Jn 



Vu A ® V/djr 






c II/IIl 2 (Q) 



and passing to the limit we deduce 



I( d2 »./)I= f 

Jn 



— I / V v ® V / dr 



^ c||/Hl 2 (G)- 



Thus the Hessian is a bounded I i near operator on L 2 (Q). By the Riesz representa- 
tion theorem there exists a function V e (L 2 (i2)) 2x2 such that 



[D 2 v, 



/)= f Vf dx. 
Jn 



As is usually done we shall denote the function V by V 2 u. Now, passing to the 
limit in equation (16) we deduce 



hVhfdx, 



/ V 2 u/d.r — — Vv <g) Vfdx — lim / H, 

Jn Jn h Jn 

for every / e C^°(Q), which implies that 
U h Vh - V 2 r in L 2 (t2). 

We now finish the proof by showing that Vi)/, converges strongly in L 2 (Q) to 
Vv. To this end, from equation (8) we find 



/ 

Jn 



Vv/, ® V v/, dx = 



— - I H hVh vt, dx + / H, 
J £2 J £2 



h VhiVh - C h v h )dx. 



Hence, 
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as H hVh y 2 v, Vi, v and (v/, — c/,v/,) — »■ 0 in L 2 (D). By calculadng the trace 

of the terms on the two sides of (17) it follows that 

lim || VO/, ||^ 2(n) = II V*t||^ 2(£2) . 

Therefore, 

Vv h — >• Vt> in L 2 (D) 

since Vv h — ^ Vv in L 2 (D). □ 

Assuming Property (★) we now deduce a density result. 

THEOREM 2. Assume that Property (★) holds. For every v e H 2 (Q) n H { \ (£2) 
there exists a sequence {£>/,} C Xq /, such that 

Vh — > v in H l (D), 

and 

H i,Vh V 2 v inL 2 (D). 



Proof. We stait by considering v € C°°(D) D H 2 (D) n H { \ ( D ) and let 0/, := r h v 
be its nodal interpolation defined in equation (7). Obviously, v h — > v in H 1 (£2). 
We now compute 



H hV„(xj) = J-[ V5*®V^dx 

\ 1 j I Jn h 






V&<8> V<pj d.r I v(xk) 



at any internal node xj. Since the function v is smooth we can write 

V(x k ) = v(Xj) + Vv(Xj) ■ (X k - Xj) 

+ ^V 2 v(xj)(x k - Xj) ■ (x k - Xj) + o(|x* - Xj I 2 ). 

Then, taking into account that 

N h 



/ £ 

v Q 1=1 



yfk ® Vi pj dx 1 v(xj) = 0, j e dy, 



(18) 



(19) 
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because Xl/t=i 9k — 1- and 

r Nh 

/ ^ Vf (*/) ' ( Xk - x i^V >k ® dx 

k= 1 

= / v |vd(x ; ) • y>, - x;)^ <g> V*, dx 

= / V(Vu(x 7 ) • (x — x/)) ® dx 

= Vu(xy) ® / V<£y dx = 0 j G $h, 

Jn 



( 20 ) 



we deduce 



H/,u/ ; (x 7 ) = 






® Vi -pj dx 1 V“u(x y -)(xt - x j ) • (x* - x/) 



N h 

+ (/; 2 ) 
*=i 

— (n.0) vv2 



( 21 ) 



= G ; y V v(xj) + 0(1) 7 e d 5 ,,, 

where we have taken into account that x/. and x ; - must belong to the same triangle in 
order to appeal - in the expressions above. In equation (21) o(l) indicates a quantity 
that tends to 0 with h, uniformly in j . Hence, by applying Property (★), 

lim sup | H/, Vh ||i 2 (S1) = lim sup J2\ H hV h (xj)\ 2 \fj\ 



j&H 



< 



lim sup ^ 



iO')|| 2| v7 2 



v 2 u(x;)| |r,-| 



It follows that 

lim sup | H/, Vh 

h 



je$h 

^ lim sup | sup H ^0) " 2 

h \ je$ h 

N h 

^ lim sup V'|V 2 u(x / )|"|P/|. 



2 <r xi 2 2 

L 2 (Q) ^ V V L 2 (Q) - 



/, II £| V V*;)| 2 |7) 



( 22 ) 



For a subsequence, not relabeled, we have that sup /; ||H/,f/, || 2 2(Q) < +oo, and hence 
from Theorem 1 we deduce that 

Uhk - V 2 r in L 2 (£2), 

while, from equation (22) we find that the convergence is indeed strong in L 2 (£2). 
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We now consider the general case, i.e., v e H 2 (Q)D //,[ (S2) . Let e C°°(£2)n 
H 2 (Q) fl H { '(Q,), k e N, be such that — »■ v in H 2 (Q). From the case discussed 
above we deduce that for every k there exists a h = h(k ) such that if we let v h := 
r h(k) w k we have 



II U>k lltfl(Q) ^ 
fH/,0/, - V u;jfc|| L2(n) ^ -- 

Now, letting k go to infinity we see that v h is the sequence we were looking for. □ 

The following important valiants of the previous theorems apply if we take into 
account the generalized Hessian up to the boundary nodes. 

THEOREM 3. Let v h e X 0h and v e L 2 (Q). If 
Vh — > v in L 2 {Q) 



and 



supHH/,0/, ||z, 2 (Q) < +oo, 
h 



then 



v e H 2 (Q), 
Vh — > v in 



and 



H h v h — y 2 v in L 2 (Q). 



Proof. Since sup /l ||H, I i;, ) || L 2 (Q) < +oo implies that sup,, ||H/,fi/, || L 2 (Q) < +oo, 

o 

the conclusions of Theorem 1 hold. Moreover, obviously, H/, Vh — ^ V 2 t> in L 2 (I2) 
implies that H/,0/, — ^ V 2 i; in L 2 (Q). Thence we have only to prove that 

v n =0 in 312. 

To see this, we can replicate the argument of Davini [11, Lemma 3], Let / e 
C°°(£2) and observe by the Green formula that 

f Vj,fds= I A vf dx + f Vu-V/ dx. 

J d t/ «/ 



(23) 
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Proceeding as in Theorem 1, we have that 

f VvVfdx = f Vv h ■ Vr h fdx + f Vv h • V(/ - r h f)dx 
J £2 J £2 J £2 

+ f V(t> — V/,) ■ V f dx = f Vv h -Vr h f d.r+o(l), 

Jn Jn 

(24) 

where we have used the fact that r;,/ — > f and 0/, — > v i n H 1 (O). By taking 
account of equation (24) in equation (23) and observing that 

r N h 

/ Vv h ■ Vr h f dx — Y"' trace (H/, 0,, (x/))/(x/)| 7) | , 

Jq 7=1 

cf. equation (8), it follows that 



r r ^ 

/ v„f d.v = / Ai;/dA- + ^ tracc(H,,{)/ ; (A' / ))/(.r / )|T / | + o(l). 
J d £2 J £2 j i 

Hence, by using obvious inequalities and passing to the limit we get 



L 



v,„fds 



/3Q 

Then, by density, 






^SUpHH/,0/, || i 2 (n) + || An 



/ v, n fds ^ (sup||H /i 0,,|| i 2 (Q) + ||Ai7|| i 2 (Q) )||/|| L 2 (Q) 

Jan v h 7 



(25) 



for all / e This implies that v„ = 0 on because, for every / e 

H 1 (Q), it is always possible to construct a sequence {/*} such that / — f k e 
//q 1 (0) and f k 0 in L 2 (I2). Thus, v G // 0 2 (C2). □ 



THEOREM 4. Assume that PROPERTY (★) holds. For every v G H ( 2 (Q) there 
exists a sequence {£/,} C A//, swc/i that 

i — > i; m //'(Q), 



and 



H/,1)/, — »• V 2 n inL 2 (Q). 

Proof. We start considering i> e C ( /(0) and proceed as in Theorem 2. Let 
€’/, := r/jV. Obviously, 0;, — > v in H l (Q) and the analysis that leads to equation (21) 
keeps on holding for the internal nodes. On the other hand, for h small enough 
v(xj) = Vv(Xj) = 0 at the boundary nodes. Thus, equations (19) and (20) hold 
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true for every node. It follows that equation (21) applies to every node as well and 
thence we have that 

lim sup || II v2 HIlW (26) 

h 

One of the implications of Theorem 1 is that H h v h — V 2 n in L 2 (Q.), as we have 
already noticed. Thus, we conclude from (26) that the convergence is indeed strong. 
By repeating the argument of Theorem 2, the thesis follows from the density of 
C ( f(Q) in H 2 (Q). □ 



5. External Approximations of Quadratic Functionals: the Equilibrium 
Problem for Anisotropic Elastic Plates 

We use the results of Sections 3 and 4 to prove the convergence of a direct non- 
conforming approximation scheme for quadratic variational problems involving 
the Hessian. Namely, we adopt the format of T -convergence theory in order to 
prove that a suitable sequence of discrete functionals {5),} defined in the spaces 
X 0 i, T -converges in an appropriate topology to the functional !F that describes the 
problem we wish to approximate. In particular, since the limit functional we shall 
consider has a unique minimizer and the discrete functionals arc equicoercive, one 
of the key properties of T-convergence stated in Theorems 7.8 and 7.24 of [9] 
applies: 

min T (v) = lim min T), ( (’) (27) 

h 

and 

u h — > u, (28) 

u and u h being the minimizers of !F and T/,, respectively. Therefore, the u /, can 
be used in order to approximate u. To compute them a sequence of discrete un- 
constrained minimum problems for the functions T), have to be solved and we can 
use standard techniques to do it. It is fair to say that the customary perspective of 
T -convergence is reversed here, since the theory is used for validating approxima- 
tion schemes rather than for getting a characterization of the limit problem, as is 
more common in other types of applications. 

Let us consider the equilibrium problem for an elastic homogeneous anisotropic 
plate under transverse loads 

Ca/3y8U,y8o(P — f *~>n (Z K. , 

B q u = 0 on 312, (29) 

Bin = 0 on 312, 

where / describes the applied loads, C a p y s are the components of the elasticity 
tensor C of the plate and B 0 and B\ arc suitable boundary operators. We assume 
that C has both the minor and major symmetries 
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and is strictly positive, that is, 

CK • K ^ m|K| 2 VK e Sym (31) 

for some positive real number m > 0. Sym stands for the space of two by two 
symmetric matrices. For our puiposes below we also assume that / e H~ l (Q). 

As to the boundary conditions, we focus on two cases: 

u — u„ = 0 on 9 £2 (clamping conditions) (32) 

and 



u = C a pysU'y8n a np = 0 on 9 £2 (simple support conditions). (33) 



To apply the techniques of T -convergence it is convenient to study the varia- 
tional formulation of problem (29). Let us define 

¥ (v) [ CV 2 u • V 2 v dx — 2(/, v), 

having denoted by (-, •) the duality pairing between // _1 (£2) and H { \ (£2). Then, the 
solution of problem (29) can be found by minimizing IF among all v € //J(£2), 
with Hp (Q) = H 2 (Q) H H { \ ( F2) or HjAQ. ) = ) in the two cases above, 

respectively. We shall extend in fact !F to L 2 (Q) by letting it take the value +oo in 
L 2 (I2) \ //|(f 2), that is, we set 



F(v) 



\ CV 2 n • V 2 ud.r -2 (f,v) 

Jo. 

Too 



if v G H 2 (Q), 

if v € L 2 (C2) \ H 2 (Q). 



(34) 



Accordingly, we introduce the sequences of discrete functionals 

mv) := 



J CH /,11 • H/jiidx — 2(f, v) ifveX 0h , 

Too V € L-(Q) \ Xq 



(35) 



in the clamping case, and 

Fh(v) := 



J ch h v ■ k h v dx - 2(f, v) tfveXoh, 

Too v € 2) \ X 0h , 



(36) 



in the simple support case. 

The next theorem follows from the theory discussed so far. 



THEOREM 5. Let C be a strictly positive symmetric fourth-order tensor and let 
f e 02). Assume that Property (★) holds. Let T be defined as in (34) and 
IF/, as in (35) or (36) in the clamping and simple support case, respectively. Then, 
Th T -converges to the functional T with respect to the lf(Q) topology. 




236 



C. DAVINI AND R. PARONI 



Proof. The two cases can be alike treated, so let us consider the clamping 
case for illustration. Since L 2 {Q.) is a metric space we can use the sequential 
characterization of T -convergence: 

‘T(L 2 ) — lim/, !Fk = F if and only if the following conditions are satisfied: 

(i) Vt> e L 2 (£I) and V{t>/,} c L 2 (£2), iy, — >• v : IF (v) ^ liminf Fh(Vh), 

L 2 (Q) h 

(ii) Vr> g L 2 (i2), 3{u/,} c L 2 (I2), iy, — >• v: F (v) ^ limsup ^(ly,).” 

L 2 (Q) h 

We refer to the book of Dal Maso [9] for more details. Let us call the two require- 
ments the lim-inf inequality and the recovery sequence condition, respectively. 

We start by proving the lim-inf inequality. Let ly, be a sequence in L 2 (Q.) con- 
verging to v in the L? norm. If liminf/, F/fvi,) = +oo there is nothing to prove. 
Hence, suppose liminf/, F,(vh ) < +oo. By passing to a subsequence, if necessary, 
we have that supJT/, (?y,)| < +oo and hence 0/, := ly, e X 0h . Moreover, using 
simple inequalities and Lemma 4 we find 

F h (v h ) > m||H/,0 / ,|| 2 2(Q) - C ||Vt)„|| i 2 (Q) 

> cil|V0/,|| 2 2(Q) -c||V0/,|| L 2 (Q) (37) 

with ci a positive constant. From these inequalities we deduce that sup ; , || Vi)/, ||/ 2 (Q) 
< Too and that sup /( 1 1 H /, /, || L 2 (Q) < Too. Then, by Theorem 3, we obtain that 
v € H 2 (tt), 

v>h — > v in H 1 (Q), and H/,0/, — ^ V 2 n inL 2 (I2). 



Since 



0 T C (H/, v h - V 2 u) • (H/, 1 ) h - V 2 v) 

= CH h v h ■ H/,0/, - 2CH/, 0/, • V 2 n T CV 2 u • V 2 n, 

it follows that 



CH/,0/, • H/,0 ; , T 2CH/, 0/, • V 2 n - CV 2 u • V 2 n, 

and using this inequality we find 

liminf ^7,(0/,) = liminf / CH/,0/, • H/,0/, dx — 21im(/, 0/,) 
h h Jn h 

^ / CV 2 u • V 2 rd.r - 2(f, v) = F(v). 

Jn 

The existence of a recovery sequence is an immediate consequence of Theo- 
rem 4, and thus we have finished the proof. □ 
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REMARK 3. It is possible to extend the analysis and allow the datum / to be 
in (i/J(£2))*, the dual of //J(£ 2), rather than just in Indeed, by an easy 

application of the Lax-Milgram lemma, for every / g (HjXAl))* there exist a 
/° e L 2 (S2), f l G L 2 (^; JR 2 ) and a f 2 G L 2 (G; M 2x2 ) such that, 

(/, V) = f (A + Z 1 • Vu + / 2 • V 2 v) dx 

JQ. 

for every v G //|(f 2). Here, {•, •} denotes the duality pairing between Hj-CAl) and 
its dual (//J(£ 2))*. It is then natural to define 

((/, v»a := f (f°v + Z 1 • VC + / 2 • H/,0) dx 
Jn 

for every v G Xo/,. Note that if Vf, — ^ v in H ] (S2) and H/,0;, — ^ V 2 v in L 2 (Q) then 

((Z v h ))h (Z. v >- 

Thence for / G ( HpdQ.))* the previous theorem still holds provided we replace 
(Z, n) with ((/, u})/, in the definition of T/,, i.e., in equations (35) and (36). 



6, External Approximations of General Convex Functionals 

The results of Section 5 extend to more general functionals. Giving symbols the 
same meaning they had in the previous sections, here we consider functionals of 
the form 

F(v) := [ W(x, v , Vv, V 2 v) dx - 2 (/, v) (38) 

Ji 2 

to be minimized in T/|(f2). We assume that W: S2 x M x M 2 x M 2x2 -a- [0, +oo) 
be a function satisfying the following requirements: 

(HI) W is a Caratheodory function; 

(H2) W(x, s, •) is convex for a.e. (x, s,£)g£2xMx R 2 ; 

(H3) there exist two positive constants c, C such that 

c|A| 2 ^ W(x, s,(, A)^C(l + | A | 2 ) 

for a.e. (x, j,5)gS2xRx M 2 . 

To stay with a mechanical interpretation, this class of problems encompasses 
the equilibrium of nonhomogeneous and non linearly elastic plates, including the 
case where the plate is supported by a Winkler foundation, for instance. Since the 
continuity of W on x is not required, the theory is applicable to materials with 
inclusions of different materials. 

The goal of this section is to study the approximation of these kind of func- 
tionals in spaces X 0h by using the generalized notion of the Hessian introduced 
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above. As before, we imagine F extended to L 2 (Ll) by defining it equal to +oo in 
L 2 (Ll) \ Hj_ ( <2 ) and introduce the sequences of discrete functionals 



Fhiv) := 



N h 

J2 W ( X J’ V h v( Xj ), H h v(xj))\fj\ - 2 {f, v) if v e Xoh, 
7 = 1 

+oo if v e L 2 (L1) \ X 0h 

(39) 



in the clamping case, or 



Fh{v) := 



N h 

Y2w(x h v{xj), Vi,v(xj), n h v(xj))\Tj\ - 2 {f, v) if v e X Qh , 
7=1 

+oo if v e L 2 (L2 ) \ X 0 i, 

(40) 



in the simple support case. In formulae (39) and (40) we have set 

1 f 

V/ ; i j(xj) := / VudA. 

I T j\ J Tj 

THEOREM 6. Assume that (HI )-(H3 ) and Property (★) hold. Let f e H~ l (£2), 
and let !F and IF/, be the functionals defined in (38) and (39), or (40), according 
to the studied case. Then Fh T -converges to the functional F, with respect to the 
L 2 (Ll) topology. 



To prove the theorem above we shall use the following well known theorem: 

LEMMA 5 (see [8]). Let LI be a bounded open set of ML. Let g: L2 x M m x R' v — > 
[0, Too] be a Caratheodory function. Let 

$(“.£) : = / g{x,u(x),%(x))dx. 

Jn 

Assume that g(x,u, •) is convex and that u^ — >■ u in L 2 (Q) and that ^ —*■ § in 
L 2 (LI) then 

liminf$(K*,&) R $(w,£). 

k 

Proof of Theorem 6. Again the proof is similar for the two kinds of boundary 
conditions, so let us consider the clamping case. By using equation (7) and defining 

Nh N h 

Chid(x) := ^ XjXfjix ), V,,t>(x) := ^V,,i;(x ; )xf.(x) 
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we can write 



Fhiv) := 



/ W(c h id, c h v , V/,n, H/,u) dv - 2(/, i>) 
5 n 
+oo 



if v e X 0 /,, 
otherwise, 



where id: M 2 — > M 2 is the identity function. 

We first prove the lim-inf inequality. Let 17 , be a sequence in L 2 (Q.) converging 
to v in the L 2 norm. If liminf/, 50, ( 17 , ) = +00 there is nothing to prove. Hence, 
suppose liminf;, 57 ,( 17 ,) < + 00 . By using H3 and proceeding as in Theorem 5 we 
deduce that 0/, := 17 , e X 0h , v € H^(Q.), 

Vh — > v in H 1 (Q), and H/,0/, — ^ V 2 t> inL 2 (I2). 



By the Scorza-Dragoni theorem (cf. [14]) we have that for every e > 0 there exists 
a compact subset K E of Q such that | (2 — K, \ < e and W restricted to K, x 1ft x 
Ift 2 x M 2x2 is continuous. Now, observing that c /, id —+ id in L°°(£2), c/,0/, — v in 
L 2 (I2), and V/,0/, — > Vv in L 2 (f 2) we have that u h := (ci,id, ci,Vh, V/, {)*) — > u := 
(id, v, Vv) in L 2 (Q.), hence by applying Lemma 5 we find 

liminf 57 ,( 1 )/,) > liminf f / k s (x ) W (u /, . H/, 17 , ) dx — 2 I i m ( /, £ 7 , ) 
h h Jn h 

> f XK e (x)W(u,V 2 v)dx -2{f,v). 

Jn 

Letting s go to zero and applying Fatou’s lemma we obtain 



liminf 50,(0/,) > 

h 



/ 

Jn 

/ 

Jn 



W(u, V 2 v)dx -2 {f, v) 

W( x, v , Vv, V 2 v)dx - 2 (/, v) = 5 r (u). 



We now prove the existence of a recovery sequence. If v does not belong to 
Hq(Q) there is nothing to prove. So let us suppose that v e Hfi(Q). By Theorem 4 
there exists a sequence 0/, e X 0 /, such that 



0/, — »■ v in // 1 (X2) , and H/,0;, — »■ V 2 v in L 2 (Q). 



It immediately follows that c/,0/, — »■ v in L 2 (f 2), and V/,0/, — > Vv in L 2 (Q). 
Moreover by passing to a subsequence, if necessary, we may also suppose that 
c/,0/,, V/,0/, and H/,0/, converge almost everywhere to v, Vv and V 2 v, respectively. 
Since 



lim (/, 0 /,) = (/, v) 

h 



it suffices to prove that 



lim sup / W(c/, id, c h 0/, , V/, £ 7 , , H/, 0/, ) dx ^ / W(x, v, Vv, V z v) dx. (41) 
h J Q. J Q, 



/ 

Jn 




240 



C. DAVINI AND R. PARONI 



Observing that the integrand below is non negative, by using (H3) and Fatou’s 
le mm a we have 

liminf [ (C(l + |H,,{' /( | 2 ) - W(c h id, c h v h , V h v h , U h v h )) dx 

'• Jo, 

> [ liminf(C(l + |H,,i) /; | 2 ) - W(c h id, c h v h , V h v h ,H h v h )) dx 

Jk s h 

= [ (C(l + |V 2 u| 2 ) - W(x, v, Vr>, V 2 u))dx, 

JK e 

where K e is the compact subset of O defined in the first paid of the proof. Letting 
s go to zero and applying Fatou’s lemma we find 

liminf f (C(l + |H/, ?)/, | 2 ) — W(ci,id, ci,vi ,, V/,fi/,, H/,0/,)) dx 
h Jo. 

^ / (C(l + |V 2 u| 2 ) - W(x, v, Vu, V 2 u))dx. (42) 

Jo. 

On the other hand we also have 

liminf [ (C(l + |H /l i), ) | 2 ) - W(c h id, c h v h , V h v h , H h v h )) dx 
h Ja 

^ limsup f C(l + | H/, 0/, | 2 ) dx 

h JQ. 

+ lim inf / —W (c h id, c h v h , V/, v h , H, ; v h ) dx 
h Jn 

= f C( 1 + | V 2 i;| 2 ) dx — lim sup f W (q, id, Ch v /, , V/, v h , H/, 0/, ) dx . (43) 
J £1 h J Q. 

Combining equations (42) and (43) we obtain equation (41), and thus we have 
completed the proof. □ 



7, Convergence of the Minimizers 

As pointed out at the beginning of Section 5 the functionals we have considered 
have properties that assure the convergence of the minimizers of to the minimiz- 
ers of the functional !F in the L 2 topology, by general properties of T -convergence. 
Flere, however, that convergence is stronger. The considerations done in this short 
section apply to the clamping case as well as to the simple support case. For 
simplicity we discuss only the former. 

Let Cl/, be a minimizer of the discrete functional !F/, defined by either one of 
equations (35) and (39). Then, due to the equicoercivity of the functional T/,, to 
Lemma 4 and to Poincare’s inequality we have that the sequence «/, is bounded in 
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and thence it has a weakly convergent subsequence (not relabeled) con- 
verging to a function u e H l (Q) which is a minimizer of F by equation (27). 
Indeed from Theorem 1 we deduce that 

Uh — ^ u, in H l (£l) and H h u h V 2 u inL 2 (£2). (44) 

The next theorem shows that also the generalized Hessian of u h converges strongly 
in L 2 (Q.) to the Hessian of u. By equation (27) this is obviously true for the 
quadratic case, but a similar result also holds for the minimizers of the functional 
defined by equation (39) provided the potential W is strictly convex in the last 
variable. 

THEOREM 7. Assume the hypothesis of Theorem 6 hold. Moreover, suppose there 

i ~- / 2 2x2 2x2 

exists a continuous function VT:t2xRxM‘'xR“‘'— “ and a strictly positive 
constant y > 0 such that 

W(x,s,%, A 2 ) ^ W(jc, s, £, Ai) + W(x, s, £, Aj) • (A 2 — Af) 

+ y\A 2 -A l \ 2 , (45) 

fora.e. (jc, s, £, Ai, A 2 ) e x M x l 2 x R 2x2 x R 2x2 . Then if u h is the minimizer 
of the functional F, defined in equation (39) we have that 

Uh — > u, in H l (Q) and H /, ft /, — > V 2 n in L 2 (Q), 

where u is the unique minimizer of the functional IF defined in equation (38). 

The proof of this theorem is standard, we include it here just for the reader’s 
convenience. 

Proof. Because of (44) we just have to prove that — ► V 2 n in A 2 (ST2 ) . From 

assumption (45) it follows that 

Fh(u h ) ^ [ (W(c h id, u h , V/,n/,, V 2 n) + y\H. h u h — V 2 n| 2 

Jo. 

+ W(chid, Uh, V/,n/ ; , V 2 «) • (H — V 2 «)) d.r — 2 (/, uf). 

From the convexity of W in the last vaiiable and hypothesis (H3), we deduce 
(cf. [8]), that | W(x, s, A) | ^ C(1 + |A|) for a.e. (x, s, ?) e 2 x 1 x M 2 , and 
from Lebesgue’s convergence theorem we find that W(ci,id , if;,, V/,«/, , V 2 «) — > 
W(x, u, Vw, V 2 n) in L 2 (Q). Hence passing to the limit in the inequality above 
and using the fact that Ffui,) — ► F (u) we deduce that 

F (u) f F (u) + y lim sup / |H h u h — V 2 m| 2 dx 

h JO. 



which concludes the proof. 



□ 
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Abstract. We study infinitesimal deformations of a porous linear elastic body saturated with an 
inviscid fluid and subjected to conservative surface tractions. The gradient of the mass density of the 
solid phase is also taken as an independent kinematic variable and the corresponding higher-order 
stresses are considered. Balance laws and constitutive relations for finite deformations are reduced 
to those for infinitesimal deformations, and expressions for partial surface tractions acting on the 
solid and the fluid phases are derived. A boundary-value problem for a long hollow porous solid 
cylinder filled with an ideal fluid is solved, and the stability of the stressed reference configuration 
with respect to variations in the values of the coefficient coupling deformations of the two phases is 
investigated. An example of the problem studied is a cylindrical cavity leached out in salt formations 
for storing hydrocarbons. 

Mathematics Subject Classifications (2000): 74F10, 74F20. 

Key words: solid-fluid mixture, conservative tractions, principle of virtual power, partial tractions, 
fluid-filled cylindrical cavity, stability analysis. 
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1. Introduction 

Simple models of a mechanical system comprised of a deformable porous solid 
matrix filled with a compressible fluid have been developed by Fillunger [22], 
Biot [11], Truesdell [40] and Muller [30]. In these works a spatial point is simul- 
taneously occupied by all constituents. This is readily comprehensible for gaseous 
mixtures [30] and fluid solutions [41], 

Observations on fluid saturated solids have shown higher values of fluid per- 
colation through pores of the solid matrix than that predicted by the aforestated 
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models (see, e.g., [13] or [33]). The increase of percolation is possibly not only due 
to the higher externally applied pressure but also due to the opening of pores in the 
vicinity of the boundary (see [17]). This provides a justification for endowing the 
theory of mixtures with the volume fraction concept (see, e.g., [18]); this additional 
scalar parameter can describe the nonstandard dilatation effects (see, e.g., [38]) as 
follows. Once the mixture is modelled from a microscopic point of view and the 
reference configuration of the solid matrix is required to be periodic, then those 
dilatations of pores which do not involve global deformations of cells arc captured 
by the volume fraction. 

In this paper we consider a binary mixture involving a second gradient solid 
and a perfect fluid; we refer the reader to [23, 24, 28] for the relationship between 
micro-structural and second gradient theories. Our approach is close to that of the 
volume fraction concept as the latter reduces to the former once a suitable con- 
straint among the enlarged set of state parameters is assumed [34]. For example, 
for an incompressible solid constituent, it is easy to see [25] how a mixture model 
endowed with the volume fraction concept transforms into a binary mixture whose 
solid constituent has second gradient constitutive relations. After formulating a 
general problem, we study deformations of a porous hollow linear elastic cylinder 
filled with a perfect fluid. In particular, by assuming that the internal energy density 
can be split into a paid involving first gradient of the displacements and a paid 
involving second order gradients of displacements, we perform parametric analy- 
sis of the density profiles of the solid matrix with respect to a suitable energetic 
coupling coefficient between the solid and the fluid. We limit our analysis to the 
case when the external tractions applied on both constituents arc conservative and 
can be derived from a potential. 

We also discuss stability of the prestressed reference configuration of the hollow 
cylinder with respect to perturbations of the aforementioned energy coupling coef- 
ficient; an energetic criterion is proposed for this analysis. The distance in the space 
of mixture configurations is described in terms of the total energy which equals the 
sum of the mixture deformation energy and the potential of surface tractions. 

Batra et al. [5-9] have analyzed numerically finite transient thermomechanical 
deformations of a homogeneous body with the Cauchy stress and higher-order 
stresses depending upon gradients of deformation. 



2. Formulation of the Problem 



Material particles of the fluid and the solid are identified respectively by their posi- 
tion vectors X (f) and X (s) in fixed reference configurations ST,', and ST, S ,. We presume 
that, at any time t, particles of both constituents occupy the same position x in the 
present configuration ST. The velocity \ (a) (a = f, s) of the material particle X'" 1 
is defined by 



v 



a 



d ( “ ) u (a ' ) (X (0,) , t) 



d t 



( 1 ) 
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where d (a) /dt denotes the material time derivative following the motion of X“ and 
u (a) is the displacement of the ath constituent from its reference configuration. Let 
p <f> and p <s> denote, respectively, the apparent mass densities of the fluid and the 
solid; then the mass density p of the mixture equals p <r> + p <s> . The mean or the 
barycentric velocity v of the mixture is defined by 

py = p (f) v (f) + p (s) v (s) . (2) 

Details of the theory of mixtures arc given in [13, 19, 27, 29-31, 33, 40]. 



2.1. BALANCE LAWS 



We presume that there is no interconversion of mass between the solid and the 
fluid. In the spatial description of motion, the balance of mass for each constituent 
is given by 



— - + p ia) div v (a) = 0, (3) 

dr 

where div v = tr(gradv), and grad denotes derivatives with respect to coordinates 
in the present configuration. We use the principle of virtual power to derive the 
balance of linear momentum and the boundary conditions for each constituent. 
That is, we postulate that 



f ( m W . y(s) + m (f) . y(f) + X (S) . V y(s) _ p (f) div + n (s > - VVv (s) ) dV 

J Q 

= f (b (s) • v (s) + b (f) • v (f) ) dV 
Jo. 

+ /( 



t (s > • V (s) + t (f) • V® + T (S) 



3v (s) \ 

- dA. 



dn / 



Here m (c,) is the bulk solid-fluid interaction force, T (s) the partial Cauchy stress in 
the solid, p <f) the hydrostatic pressure in the fluid (we assume that the fluid is ideal; 
therefore partial Cauchy stress in it is spherical), II (s) the second-order stress in 
the solid, V the gradient operator with respect to coordinates in the present con- 
figuration, b" J> the density of partial body forces, t ta) the partial surface tractions, 
r (s) the traction corresponding to the second-order stress tensor in the solid, v (s) 
the virtual velocity in the solid that vanishes on the paid of the boundary of the 
solid where essential boundary conditions* arc prescribed, v f the virtual velocity 
in the fluid that vanishes on the paid of the boundary of the fluid where essential 
boundary conditions arc specified, a • b the inner product between tensors a and 
b of the same order, and 9v (s) /3 n is the directional derivative of v (s) along the 
outward unit normal n to the boundary 9C of C. The effect of inertia forces is 



* For a boundary- value problem with field equations involving derivatives of order 2m, boundary 
conditions involving derivatives of order at most (m— 1) are called essential; others are called natural. 
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included in the density of body forces. The physical meaning of II (s) and r fs) can be 
described in a way similar to that done in different contexts in [20, 23]. We note that 
the external action r (s) can be regarded as the sum of two different contributions, 
the first one is a doubly normal double force, i.e., an external areal action which 
works on the rate of opening, (Vv (s) • n ® n), along the outward unit normal n of 
pores on the boundary, the other one is a tangential couple working on the vorticity 
of the apparent velocity of the solid; this nomenclature is due to Germain [23].* 
The areal action is also considered in the Cosserat model for granular materials 
(see, e.g., [21]) and in the present problem vanishes. However, the doubly normal 
double force plays an important role in the dilatancy phenomenon studied here. 
A motivation for considering higher-order stresses in the solid will be provided 
below. We note that capillary type forces in the fluid, discussed in the literature by 
second-order stresses (see, e.g., [14-16, 26]), have been neglected here to keep the 
analysis tractable. Whereas we have included in equation (4) the internal supplies 
m (f) and m fs) of the I i near momentum, the internal supplies of the moment of 
momentum have been neglected. This is consistent with the assumption that the 
stress in the fluid is a hydrostatic pressure. The symmetry of T (s) follows from 
equation (4) by setting 

y(b _ y(s) _ velocity field of a rigid body motion, 

that is, from the objectivity of the left-hand side of (4) which also implies that the 
sum of m (f) and m s) equals zero. 

By using the divergence theorem and exploiting the fact that equation (4) must 
hold for all virtual velocities vanishing on the paid of the boundary where essential 
boundary conditions are given, we obtain the following set of field equations and 
boundary conditions: 



div(T (s) - div n (s) ) - m (s) + b (s) = 0, 


in £2, 


(5) 
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< 

"a 

3 

1 

3 
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+ 

cr 
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II 

© 


in £2, 


(6) 


m (s) + m (f) = 0, 


in £2, 


(V) 


(T (s) - div n (s, )n - div s (II (s) n) = t (s) , 


on 9i £2 , 


(8) 


(n (s) n)n = r (s \ 


on 9i £2 , 


(9) 


v (s) _ ~(S)^ 


on 9 2 £2, 


(10) 


1 

“a 

3 

S 

II 

s 


on 9 3 £2, 


(11) 


v (t) • n = 0® , 


on 9 4 £2. 


(12) 



Here div s is the surface divergence on 9 £2, and 9] £2 and 9 2 £2 are complementary 
parts of the boundary 9 £2 of £2, where natural and essential boundary conditions, 
respectively, are prescribed for the solid; a similar interpretation holds for 9 3 £2 and 
9 4 £2 for the fluid. 

* Here a ® b is the tensor product between the nth order tensor a and the mth order tensor b 
defined as (a ® b)c = (b • c)a for every mth order tensor c; A • B = fr (AB T ) for tensors A and B. 
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2.2. CONSTITUTIVE RELATIONS 



The balance laws (5) and (6) arc to be supplemented by constitutive relations; we 
express these in terms of the internal energy. We presume that the mixture is at 
a uniform temperature, the constituents arc deformed quasistatically so that their 
kinetic energy can be neglected, and no energy is dissipated. The continuum model 
we use in order to describe deformations of a solid matrix saturated with a fluid 
must account for the strain energy associated with the gradient of the mass density 
of the solid. At the macroscopic level, the dependence of the strain energy on the 
gradient of deformation describes the effects of the opening of neighboring pores 
on the pore cluster which is modelled as a solid material point. We assume that 
the internal energy density can be split into two parts: a paid that depends upon 
the “local” deformation of the solid and the fluid particles and another paid that 
depends upon a “nonlocal” measure of deformation of the solid particles; the latter 
is taken to be proportional to |Vp®| 2 . Thus we write the balance of energy as 




which because of (4) implies that 



W)l 



dV 



j t J^p e(p®,F®, X®) + ^|Vp ( 

= f (m (s) • v (s) + m® • v (f) + T® • Vv (s) - p® div v® 

Ja 

+ n (s) • Wv (s) ) dv. 



(13) 



(14) 



Here F® = Gradx = dx/dX is the deformation gradient for the solid, a s > 0 is 
a material parameter with units of Newton m 6 /kg 2 , and d/d t signifies the material 
time derivative following the mean motion of the mixture. 

By using the Reynolds transport theorem the left hand side of equation (14) can 
be represented as a 1 i near functional of the velocity field of the two constituents. 
Thus the following constitutive equations for the partial Cauchy stresses T®, the 
solid-fluid interaction forces m ( ®, a = s, f, and the second-order stress II® 
associated with the solid constituent must hold: 



'■p(s) 



p 



(f) 



9F® 



r(s)T 



7 s 



(1 + £®)I + Vp [S> ® Vp 
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y/“U>. 



(15) 



(16) 
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n (s) 



m 



(s) 



-A s /5 (s) r 



-m (f) = -p 



Vp (s \ 



>(VF (S) ) 



3F (s) 



+ £ W F' 



(f)ir(s)— T 



de 

3XW 



(17) 



2 P \ 



(18) 



where 

f s = Vp (s) • Vp (s) , (19) 

and t; (f) is the mass fraction of the fluid phase. Note that the partial Cauchy stress 
tensor T (s) is symmetric and 3e/3F (s) equals the partial first Piola-Kirchhoff stress 
tensor of the solid constituent. Equations (15)— (18) arc derived in [34] where Ger- 
main’s [23] arguments arc used to obtain constitutive relations for a second gradient 
porous matrix filled with an ideal fluid. Equation (13) does not include all features 
of a general second gradient linear elastic matrix; only density gradients have been 
assumed to affect the internal potential energy and contributions of other compo- 
nents of the third order tensor. Grad F (s) , have not been considered. Furthermore, 
we only analyze static deformations of the mixture. Thus Darcy-type drag forces 
are not modeled. 



2.3. SPLITTING OF EXTERNAL SURFACE TRACTIONS INTO PARTIAL 
TRACTIONS 

We consider problems for which b ls) = b (f) = 0, i.e., there arc only external surface 
tractions. In a physical problem, total surface tractions arc prescribed either on a 
part or on all of the boundary of the region 12. Here we require that these tractions 
be assigned on all of the boundary of the mixture in the current configuration. 
However, the solution of the boundary- value problem defined by equa- 
tions (5)— (12) requires that the partial surface tractions be specified. In order to 
find the partial tractions we assume the existence of a potential function such that 

f (V) . V W + tffl . y( f )+ T ( S ).— W 
J a a V 3« / 

= —[ f ext (x,p (s) ,p (f \ Vp (s) ) dV. (20) 

d t Jq. 

The external surface tractions for which equation (20) holds arc conservative. It 
is readily apparent that not all conservative surface tractions arc characterized 
by equation (20). Here we consider only those surface tractions which satisfy 
equation (20) and \// cxl depends upon deformations of the solid only through p (s) 
and Vp (s) . Equation (20) is dictated by the intended application of studying static 
deformations of an annular cylindrical porous region filled with an inviscid fluid. 




